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To Eric Beaurepaire 


Foreword 


In October 2018, as the last epiphany of a long-standing tradition, the 7th 
International School *Magnetism and Synchrotron Radiation", most commonly 
known as “Mittelwihr School of Magnetism”, gathered more than 100 young (and 
older) scientists from the whole Europe as well as from the USA and Japan. For the 
first time of this long series, one of the organizers of the very first Mittelwihr 
schools, the recently departed Eric Beaurepaire was not attending the school and, in 
many occasions, lectures were given with a very special attention to Eric’s 
pioneering work. This textbook is dedicated to Eric and can be regarded as a tribute 
to his scientific achievements. 

This version of the “Mittelwihr School" was not much different from the pre- 
vious ones so that the Lecture Notes contain most of the expected, basic ingredients. 
It starts with an introduction to the physics of modern X-ray sources (Chap. 1) 
coupled to a profound deep presentation of light/matter interaction in the X-ray 
range (Chap. 3) with a complete overview of all different types of angular 
dependence for X-ray absorption spectroscopy (Chap. 4). A broad and general 
introduction to magnetism (Chap. 2) is followed by an involved description of 
spintronics (Chap. 5) and physics of superconductivity (Chap. 6) with emphasis 
brought to how spintronics or superconductivity can be understood thanks to X-ray 
spectroscopies and X-ray scattering. 

The 7th International School *Magnetism and Synchrotron Radiation" took 
place in a context where many storage rings were either upgraded or looking 
forward to be upgraded. For most X-ray beamlines on third-generation storage 
rings, the vertical emittance of the electron beam was close to the diffraction limit 
imposed by the X-ray wavelength, i.e. 4/4z. Not much could be gained in this 
direction. On the contrary, for most 3rd generation storage rings, the horizontal 
emittance was much larger than the diffraction limit. At SOLEIL, the horizontal 
emittance of the electrons was 4000 pm rad so that it was more than one order of 
magnitude larger than the diffraction limit for X-rays with energy larger than 250 
eV. At ESRF, a similar situation existed. Reducing the horizontal emittance was 
then regarded as a target to follow in order to increase the brilliance and this was 
first obtained at MAX-IV by replacing the “long” dipole bending magnets by a 
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series of smaller dipole magnets yielding the multibend achromat lattice. Brilliance 
is defined as the number of photons per 0.196 bandwidth, per second, per surface 
of the source and per horizontal and vertical angular divergences of the source. 
Reducing the size of the source was an efficient way to increase the brilliance. 
Brilliance is a driving parameter for experiments where the photon source is 
focused on the sample and in such cases, the increase of brilliance is fully related to 
an increase of useful photons. It should nevertheless be kept in mind that for 
experiments where the beam is not refocused on the sample, the gain in brilliance is 
not accompanied by a high gain in useful photons: this is indeed what is observed 
for experiments where the figure of merit is the flux at the sample position (the flux, 
expressed in J m^? s~!, is the energy of all the photons crossing a surface unit per 
second). 

As a consequence of increasing the brilliance, the new X-ray beams of the 
diffraction-limited storage rings have a much higher level of coherence since one 
estimates that a source with size c and divergence c' has a 50% coherence for 7 
such that oo' = 4/47. Not regarding the controversy between brilliance and flux, 
the reduction of the emittance in the horizontal plane opens new perspectives for 
experiments taking advantage of coherence such as X-ray phase-contrast imaging, 
ptychography, Fourier transform holography, X-ray photon-correlation spec- 
troscopy, coherent diffractive imaging, or fluctuation microscopy. 

Chapter 1, “X-ray sources at large scale facilities", gives a brief introduction to 
the physics of storage rings with special attention to the physics of electron 
accelerators and to the devices developed to produce X-rays: bending magnets, 
wigglers, undulators. One can notice an introduction to the diffraction-limited 
storage rings that are presently blooming in Lund (MAX-IV) or Grenoble (up- 
graded ESRF) and that will replace the old, common structure of conventional 
storage rings. The X-ray free-electron lasers in conjunction with the self-amplified 
spontaneous emission of X-rays are also presented and the characteristics of theses 
new X-ray beams are described so that it is accessible to most, future potential 
users. Based on the knowledge of the X-ray beams, Chap. 3, "Electronic structure 
theory for X-ray absorption and photoemission spectroscopy" presents the con- 
nection between the electronic structure of matter and core-hole spectroscopies such 
as X-ray absorption spectroscopy and X-ray photoemission spectroscopy. I would 
like to attract the attention of the reader concerning the recently developed, state 
of the art pieces of theory that go beyond the common density functional theory 
(DFT). The time-dependent density functional theory, the Green's function 
approach (with special care to treatment of the self-energy within the GW 
approximation), or the Bethe-Salpeter equation are presented in a concise way so 
that the reader interested in learning beyond the DFT can get a flavour of these 
actual topics, still under development. Chapter 4, *X-ray dichroisms in spherical 
tensor and Green's function formalism", calculates the various types of angular 
dependence present in XAS. It first starts with a presentation of the multi-electronic, 
many-body calculation of the X-ray absorption cross sections. Then connections are 
made with the atomic multi-electronic approach as it is developed in the ligand-field 
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multiplet approach or with the monoelectronic DFT approach up to a brief survey 
of the many-body extended theories. The second part is dedicated to the angular 
dependence of electric dipole, E1.E1 and electric quadrupole E2.E2 terms, present 
in the X-ray absorption cross sections where the emphasis is brought to their tensor 
expansions. Various symmetries are considered, either octahedral or tetrahedral, as 
well as the presence of an external or molecular magnetic field along with the 
directions of high symmetry. The presentation is exhaustive but for the absence 
of the crossed-terms necessary for the calculation of X-ray natural circular 
dichroism and X-ray magneto-chiral dichroism. 

Chapter 2, “Concepts in magnetism”, is a nice and concise introduction to 
magnetism, necessary to make the present book a self-contained piece of work. The 
basic, theoretical framework is presented so that spectroscopists with no knowledge 
of magnetism can grasp the general picture. A special attention is devoted to the 
role played by the orbital part in the building of the magnetic properties. This 
emphasis needs to be acknowledged because the X-ray/matter interaction can be 
fully described in the electric dipole and electric quadrupole approximations, where 
no spin variables are present, so that X-ray absorption can be highly sensitive to the 
orbital part of the electron wave function and more specifically to the orbital 
magnetic moment. 

Chapters 5 and 6 are descriptions of special states of matter where chemists and 
physicists are tailoring matter so that it presents original, unprecedented charac- 
teristics. In the first part of Chap. 5, “Spintronics and synchrotron radiation", the 
authors give an overall view of the concept of magnetoresistance and they explore 
how magnetization can be triggered in the absence of an external magnetic field, 
with a special note on spin-orbitronics and the fascinating skyrmions. Then the 
authors present the information that can be gained by advanced X-ray spectro- 
scopies such as angle-resolved photoemission spectroscopy, X-ray magnetic cir- 
cular dichroism and scanning transmission X-ray microscopy. In Chap. 6, “p-wave 
superconductivity and d-vector representation”, the author presents advanced 
concepts in the field of superconductivity. Through a special attention paid to the 
role played by the orbital part of the electrons responsible for superconductivity, the 
reader can make a connection with the information that can be extracted from 
advanced, polarized X-ray spectroscopies. 


Paris, France Ph. Sainctavit 


Preface 


This volume contains the lecture notes of the seventh school on Magnetism and 
Synchrotron Radiation held in Mittelwihr, France, from 7 to 12 October 2018. 

This school has been contributing since 1989 to federate the community which 
uses synchrotron radiation for magnetic investigations, starting with first-generation 
synchrotrons, until today where new opportunities open up with the access to 
free-electron lasers. We would like to dedicate this school to the late Eric 
Beaurepaire (1959-2018) from IPCMS-CNRS for his tribute to the existence of this 
school and to the science of magnetism. He was one of the founders and passionate 
supporter of this school. He set several milestones in the field of femtomagnetism, 
first using pump-probe laser experiments to unravel the magnetization dynamics 
with a founding paper in 1996, and several years later, making use of time-resolved 
experiments with synchrotron radiation. We will also remember him as a simple 
and very human person. 
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Chapter 1 A) 
X-Ray Sources at Large-Scale Facilities S 


Philip R. Willmott 


Abstract High-brilliance X-ray sources are powerful probes to investigate the prop- 
erties of matter down to the sub-angstrom scale and on time scales that can extend 
below a femtosecond. In this chapter, an introductory overview of the physics behind 
storage ring-based synchrotrons and linear accelerator-based X-ray free-electron 
lasers is presented, while the properties of the radiation they produce are explained. 


1.1 Introduction 


Since their discovery by Wilhelm Róntgen in the last decade of the nineteenth century, 
X-rays have played a central role in all branches of the natural sciences and medicine. 
The primary reasons for this are threefold. Firstly, ‘hard’ X-rays (that is, those with 
photon energies in excess of a few keV and up to approximately 50 keV, see Fig. 1.1), 
have interaction strengths that, on the one hand, are small enough to allow them to 
penetrate deeply into solid matter, while, on the other, are sufficiently large that 
these interactions are easily observable. Secondly, X-rays have wavelengths A on 
the nanometre to angstrom scale, meaning that, according to the Abbe limit, they 
can be used to image objects composed of features down to sizes comparable to À. 
Finally, the binding energies of electrons, from weakly bound valence electrons to 
very strongly bound core electrons in heavy elements, lie in the ultraviolet to hard 
X-ray regime, allowing detailed studies of these bonds through spectroscopy, thereby 
providing insights into chemistry, electronic structure and magnetic properties. 
Modern scientific disciplines are increasingly concerned with correlating physi- 
cal structure with physical properties. This has been long recognized in the lock- 
and-key functionality of biological structures such as enzymes and proteins. In 
condensed matter physics, the properties of many emergent materials, in particu- 
lar (though not exclusively) transition metal oxides, depend exceedingly sensitively 
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Fig. 1.1 Photon energies and the electromagnetic spectrum. Above the visible regime, spanning 
only an energy range of 1.77-3.1 eV, the electromagnetic spectrum is divided into UV (up to 
approximately 140 eV), soft X-rays (up to 2 keV), tender X-rays (to 4 keV), hard X-rays (to 
50 keV) and gamma rays (above 50 keV). Important photon energies are highlighted, green (K - 
edge) and red (L-edge) arrows pointing down imply absorption energies, while those pointing up 
imply emission lines 


on the structure—even changes of a few picometres in bond length, or one degree in 
bond angle, can have fundamental consequences on the material's electronic char- 
acter [1]. 

Protein crystals can often only be grown with linear dimensions of a few tens 
of micrometres, while interfacial regions between different oxide materials which 
exhibit unexpected properties [2] may only be a few nanometres thick. Any signal 
from irradiation of such samples using X-rays, be it the degree of absorption, the 
amount of elastically scattered radiation, fluorescence or the photoelectron yield, is 
likely to be very weak. This sets a premium on finding a very intense X-ray source— 
synchrotrons and X-ray free-electron lasers (XFELs) have been developed to fulfil 
this need. 

The broad range of applications of X-rays has manifested itself in the last two 
decades in the heterogeneity of scientific fields served and the broad spectrum of 
techniques now available at synchrotron facilities, representing perhaps the clearest 
illustration of multidisciplinary research, covering the natural and medical sciences 
and many aspects of engineering and technology. Nowadays, there are worldwide 
more than seventy facilities in operation, or under construction, providing sophisti- 
cated investigative tools for well over 110 000 users from virtually every field of the 
natural and engineering sciences. 

These users need to understand the operating principles of synchrotrons and their 
generation of X-radiation in order to best prepare, firstly, proposals to be submit- 
ted to the highly competitive review procedure at synchrotrons, and secondly, the 
beamtimes themselves. This brief overview of synchrotrons, synchrotron sources, 
and XFELs draws substantially from chapters on synchrotron and XFEL physics in 
[3] and as such is intended as an accessible primer to the interested reader from any 
branch of the natural and engineering sciences. 

In the next section, the architecture and operating principles of synchrotrons are 
described and the standard figure of merit for synchrotron light, called the brilliance, 
is introduced. The three different types of source (bending magnets, wigglers and 
undulators) are presented in Sect. 1.3. We finish this section by describing ways to 
control the polarization of X-rays at synchrotrons. 
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Sections 1.4 and 1.5 outline the most pertinent features of the latest generation of 
storage rings, so-called diffraction-limited storage rings (DLSRs), and X-ray free- 
electron lasers (XFELs), respectively. 


1.2 A Brief Description of Synchrotrons 


1.2.1 Introduction 


A synchrotron consists of a ring-shaped evacuated vessel (the storage ring, having 
a circumference measured typically in a few hundreds of metres, Fig. 1.2) in which 
high-energy electrons circulate at highly relativistic velocities, and so-called *beam- 
lines’, that extract and use the radiation emitted by the electrons tangentially to their 
orbital path, at positions defined by components known as bending magnets (BMs) 
and insertion devices (IDs). 

The electron energy € at synchrotrons is typically of the order of a few GeV. The 
emitted photons, on the other hand, have energies measured anywhere between a few 
eV (just above the visible) and several hundred keV, in the ultrahard X-ray regime.! 
Note, however, that even the photon energies of the latter are still some four orders 
of magnitude smaller than the electrons’ kinetic energy € in the storage ring. 

The electrons are forced into a closed path by bending magnets, which exert a 
centripetal Lorentz force on them. It is here, and in straight sections in which the 
insertion devices are installed (see Sect. 1.3), that they emit electromagnetic (EM) 
radiation. 

The electrons lose energy due to their emitting EM radiation. This must be replen- 
ished and is achieved via axial acceleration through one or more radio-frequency (RF) 
cavities installed in the storage ring. 


1.2.23 The Lorentz Factor 


Before we proceed further, the dimensionless parameter y is introduced. This so- 
called ‘Lorentz factor’ expresses the ratio of the electron energy € to the rest mass 
energy of the electrons mec? = 511 keV (m, = 9.109 x 107?! kg is the electron rest 
mass and c = 2.9979 x 108 ms™! the speed of light), that is 


E 


Mec? 


ys (1.1) 


‘Some facilities host beamlines that extend down into the far-infrared regime; these relatively 
uncommon sources are not discussed here. 
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Fig. 1.2. A schematic of the basic components of a modern synchrotron facility. Electrons from a 
source such as a heated filament in an electron gun are accelerated by a linear accelerator (LINAC) 
and then injected into a booster ring, where they are further accelerated. They are then further 
injected into the so-called storage ring. There, they are maintained in a closed path using bending- 
magnet achromats at arc sections. The beamlines use the radiation emitted from insertion devices 
(IDs, either wigglers or, more commonly, undulators) placed at the straight sections between the 
arcs, and from the bending magnets (BM), on the axes of emission. The energy lost by the electrons 
through the emission of synchrotron light is replenished by particular parts of the cycle of one or 
more radio frequency (RF) supplies. This forces the electrons within the storage ring to separate 
into discrete bunches. Each bunch contains of the order of 10? electrons and has a full width at half 
maximum duration of the order of 100 ps. Reproduced from [3] with permission (Copyright 2019, 
John Wiley and Sons) 


Consequently, for typical storage ring energies of the order of a few GeV, y is of 
the order of a few thousand to a little over 15 000 (for the Advanced Light Source 
in Berkeley, £ — 1.9 GeV and hence y — 3718, while the highest energy storage 
ring, SPring8, has a storage ring energy of £ — 8 GeV, leading to y — 15656). The 
Lorentz factor crops up in many equations related to synchrotron radiation (SR), 
including the beam divergence, relativistic electron mass, electron emittance and the 
radiative power output. 
From the special theory of relativity, it emerges that 


-1/2 
y-[-oeTp ^, (1.2) 
where v is the electron's velocity. We re-arrange this to obtain 


v2c(1-1/p?) ^, (1.3) 
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We know y is of the order of several thousand, hence 1/y? is a very small number, of 
the order of 1075. We use the approximation (1 — x)" — 1 — nx forx « 1 to obtain 


C—y 1 
Pa, 1.4 
P 2y (1.4) 


In other words, the difference between c and v is very small, of the order of a few 
ms-!. 

Lastly, the mass of the electron from the perspective of a stationary observer is 
equal to yme. We will return to these findings later. 


1.2.3 Dipole Radiation and Synchrotron Radiation 


Why do electrons emit EM radiation at all? First, it should be stressed that elec- 
trons, or indeed any charged particles, only emit EM radiation when accelerated. 
‘Accelerated’ can include the conventional meaning of an increase in speed but no 
change in direction; its opposite, that is, a deceleration; or a change in the electrons’ 
direction, such as in centripetal acceleration. The second case corresponds to the so- 
called ‘Bremsstrahlung’ (German expression for braking radiation), while the third 
is associated with SR. 

Given this, why then does the action of accelerating electrons cause them to emit 
light? Consider Fig. 1.3. EM radiation is a form of transverse wave in which the 
oscillations (of the electric and magnetic fields) are at right angles to the direction of 
motion. For the sake of simplicity, we consider here only the electric field component 
of the EM radiation. The electric field lines of the electrostatic field of a stationary 
and isolated electron emanate out radially from the electron. Any observer looking at 
the electron, therefore, sees no transverse component of the field, which thus implies 
she sees no radiation [Fig. 1.3a]. 

If, however, the electron is made to execute oscillatory motion, the electric field 
lines, which are anchored to the electron and emanate out from it at the speed of 
light, will reflect this motion and thus also oscillate accordingly [Fig. 1.3b]. In all 
directions except that exactly along the axis of acceleration, our observer will ‘see’ 
a transverse component to the electric field and therefore perceive that light is being 
emitted. The amplitude of the radiation is proportional to cos x, where x is the polar 
angle between the axis of acceleration and the observer's direction. The intensity 
distribution, shown in Fig. 1.3b, is proportional to cos? x. This so-called ‘dipole 
radiation' is the reason why mirrors reflect visible light, radio antennae capture or 
emit radio waves and undulators produce X-radiation. 

SR is highly collimated, with natural divergences of the order of 0.1 mrad (mrad; 
1 mrad is approximately equal to 0.06?). A detailed derivation of the spatial distri- 
bution of SR lies beyond the brief of this introductory overview (see, for example, 
[3]). Suffice it to say, it differs substantially from the dipole distribution shown in 
Fig. 1.3b; this is due to relativistic Doppler shifting. The angular power distribution 


6 P. R. Willmott 


cos ot 
a 4 4 
(b) ( € Ka 
» * 
» n 
Y 
We. r 
~ » 
4 -— 
P 
3 = 
^4 -Q+ | -DA 
p 
- 
^ 
^ ~ 
p 
— P 
> 
Y d ià 
, a 
r 
(ccc 
VB 


Fig. 1.3 Generation of EM radiation through the acceleration of charged particles. a A charged 
particle at rest or moving at a constant velocity will not emit light, as an observer of the particle will 
detect no lateral component of the electric field lines. b If, however, the particle is accelerated, an 
Observer positioned anywhere except along the axis of that acceleration (position A) will experience 
a shift in the position and direction of the electric field lines as the event horizon washes over them 
at the speed of light (for example, at position B). A simple harmonic driving force will generate 
radiation from the electrons at the same frequency. Adapted from [3] with permission (Copyright 
2019, John Wiley and Sons) 


per solid angle dP/dQ for an electron moving with a velocity v is given without 
derivation as 


(1.5) 


dQ “Q fcos0y y^ — Bcos6)2 


dP a | sin? 0 cos? $ | 
where $ and 0 are the polar (out of the orbital plane) and azimuthal (in the orbital 
plane) angles, respectively, « = e?/(167z?€oc) = 6.124 x 107? kgm? s~! and a = 
Bec/(ym,) is the acceleration perpendicular to the direction of motion due to the 
Lorentz force exerted on the relativistic electron by the magnetic field B. Note that in 
the nonrelativistic limit of v «& c (B = v/c < 1), (1.5) reduces to the simple cos? 0 
dependence of dipole radiation. The progression from dipole to synchrotron radiation 
for different values of 6 is shown in Fig. 1.4). 

For a given centripetal acceleration a, the maximum power (in the forward direc- 
tion) scales with the fourth power of the electron-beam energy. At highly relativistic 
velocities, it emerges that 


2 
dP 1- (y0y 
a cp LL O9] (1.6) 
m [1+ (6?] 

where x! = e*/(2x?m2egc) = 1.5156 x 107^ m? C? kg^! s7!) (or WT’). In the 
frame of reference of the electron, the distribution remains pure dipole radiation 
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Fig. 1.4. From dipole radiation to synchrotron beams. The progression of the angular power dis- 
tribution of radiation from an electron travelling at a fraction of the speed of light 6B = v/c while 
experiencing a centripetal acceleration a perpendicular to its motion. The case v = 0 corresponds 
to dipole radiation. As £ increases, the radiation distribution is swept in the forward direction 
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(B = 0 in Fig. 1.4) and the opening angle is 6’ = +x /2. From the perspective of a 
stationary observer, however, the angular distribution is modified such that 


0 = sin! Ez (1.7) 
y (1 + B cos 0?) 


Thus, the opening angle changes from +7 /2 in the electrons’ frame of reference to 
+1/y in the laboratory frame. The entire beam therefore lies within +1/y, and has 
a full width at half maximum of approximately 1/y. This ‘natural opening angle’ 
(or divergence) of the narrow radiation cone, for typical storage ring energies of 
1—8 GeV, is equal to 0.5—0.06 mrad (0.028—0.0034°), respectively; SR is highly 
collimated. 

Lastly, it can be simply demonstrated from (1.2) and (1.5) that, for a given accel- 
eration a, the ratio of the power in the forward direction (0 — 0) for an electron beam 
travelling at a non-zero speed perpendicular to a, to that of the maximum of dipole 
radiation for an electron with zero velocity perpendicular to the acceleration is 


P@=0,6840) A-B) ,, 
P(0—0,8—0) -y» ~ 8y (1.8) 
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where the approximation on the right is valid for relativistic velocities. Both the 
exact and approximate expressions are shown in Fig. 1.5 up to 8 = 0.9999. Increases 
in power of the order of 10/6 compared to electrons undergoing stationary dipole 
oscillations can thus be expected—synchrotrons truly do deliver powerful beams! 


1.2.4 Spectral Flux, Emittance, and Brilliance 


Flux and brilliance are figures of merit of the quality of a synchrotron facility. The 
spectral flux is defined as the number of photons per second per unit bandwidth 
(BW), normally given as 0.146, and is the appropriate measure for experiments that 
use the entire, unfocussed X-ray beam. Brilliance essentially states how tightly the 
spectral flux is collimated and how small the source size is. It is defined as 


photons/second 


; (1.9) 
(mrad)^ (mm? source area) (0.1% BW) 


Brilliance — 


and is therefore equal to the flux per unit source cross-sectional area and unit solid 
angle. Note that the flux (as against the spectral flux) is simply measured in photons 
per second. Doubling the transmitted BW from a broadband source thus doubles the 
flux, but leaves the spectral flux unchanged. From (1.9), it is seen that the brilliance is 
inversely proportional to both the source size and the beam divergence. The product 
of the linear source size c and the beam divergence o’ in the same plane is known 
as €, the emittance in that plane, that is 


€; = 040, , (1.10) 
(1.11) 


The goal of the machine physicist designing a synchrotron magnet lattice is to provide 
as low an emittance as possible, in other words, a source with an exceedingly small 
cross section emitting X-rays that are highly collimated. For a given synchrotron 
storage ring, the emittance in each transverse direction (x, in the orbital plane, or y, 
perpendicular to the orbital plane) is, according to Liouville's theorem, a constant. 
The emittance will be different for different facilities, in each case being determined 
primarily by the degree of sophistication and perfection of the magnet lattice. 

Importantly, the total emittance in a given plane is a convolution of the contribution 
from the electron beam and that from the emitted photons. It follows that the total 
source size o,,, and divergence o; , in the x- and y-planes perpendicular to the 
direction of propagation of a given storage ring are also convolutions of contributions 
from the electron and photon beams, that is 


ox y = [0E +P] , (1.12) 


/ le /p 1/2 
of 7 [G5 * oy] ^ . (1.13) 
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While the electron contribution can be minimized by sophisticated electron optics 
(see Sect. 1.4), the photon emittance is an intrinsic property defined by Heisenberg's 
uncertainty principle, and is equal to A/4z in both the x- and y-plane. In third- 
generation storage rings, the electron emittance in the orbital plane e? dominates the 
total emittance and is thus the limiting factor for the brilliance. In fourth-generation 
DLSRs, the benchmark for modern storage ring designs, e? has been reduced to values 
close to or even below (in the case of soft X-rays) the intrinsic photon emittance, 
which we consider in detail in Sect. 1.4. In other words, the emittance is no longer 
limited by the electron optics, but by the fundamental optical diffraction limit. This 
is the meaning of the moniker 'diffraction-limited storage ring’ defining the fourth- 
generation synchrotron facilities now coming online. 

Synchrotrons have brilliances using modern undulators of approximately 10?? pho- 
tons s~! mrad? mm"? 0.196 bandwidth"! . This is some 12 orders of magnitude higher 
than that of a standard laboratory-based Cu K o source and less than a factor of 100 
lower than high-quality visible laser sources. The main reasons for this are the size 
of the radiation source, of the order of ten micrometres at fourth-generation DLSRs, 
the high collimation of the beams, being of the order of 10 rad in the orbital plane 
and the fact that synchrotrons emit an enormous amount of light. The power emitted 
by an electron is proportional to the square of the electron's centripetal acceleration 
a, and the fourth power of the storage ring energy. 


1.2.5 The Radio-Frequency Power Supply 


Conservation of energy dictates that the kinetic energy of the electrons is dissipated 
due to emission of radiation at the bending magnets and insertion devices. This 
energy must be replenished, or otherwise, the electrons would spiral into the inner 
wall of the storage ring. This is achieved by boosting the electrons' energy at one or 
more positions along the storage ring as they pass through RF cavities [Fig. 1.6a]. 
This requires that the electrons enter the cavity at a certain point of the RF cycle. 

Because the electrons can only receive the correct amount of energy at very 
specific and narrowly defined values in the RF cycle, they are separated into a series 
of packets, or *bunches'. The energy loss of the electrons for each cycle around the 
ring is given by the total power loss of the storage ring divided by the storage ring 
current and is equal to approximately 0.2-1 MeV, or of the order of 0.05% of the 
nominal electron energy. Depending on the size of the facility, most storage rings 
host between two and eight RF cavities. Between them, they must be able to replenish 
this loss. 

Consider Fig. 1.6b. On average, the electrons require a certain energy boost in 
order to keep them on a stable path, given by an amount eV,ef. If an electron loses 
more than this amount of energy, it will enter the RF cavity somewhat earlier at 
point A. This might sound counterintuitive—surely if the electron has less energy, it 
will be slower and enter the cavity later. But because it takes a shorter path in the bends 
according to the linear dependence of the bending radius and the electron energy it 
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Fig. 1.6 Replenishing the electron energy in a storage ring. a Electrons entering the resonant RF 
cavity at the correct moment in its voltage cycle are accelerated by a suitable amount by the electric 
field within the cavity. Note that the field lines point in the opposite direction to the acceleration, as 
the electric force is Fg = qE but an electron has a negative charge —e. b ‘Slow’ electrons entering 
the RF cavity at A will be given more of a boost than ‘fast’ electrons at B. Adapted from [3] with 
permission (Copyright 2019, John Wiley and Sons) 


indeed will arrive earlier than a higher energy electron, and will thus experience a 
larger acceleration than if it were at the reference voltage. Likewise, if the electron is 
too fast, it will receive less of a boost. Any electrons entering the RF cavity outside 
this narrow range above and below the reference voltage will not gain the correct 
energy and will be lost to the system. The electrons therefore quickly bunch into 
packets associated with each cycle of the RF cavity. 

The short bunch lengths allow users to exploit the time structure of SR down to 
well below the nanosecond time scale for time-resolved experiments. These types 
of experiment became increasingly important in third-generation synchrotron facil- 
ities, in areas as diverse as molecular biology, catalysis, condensed matter physics 
and domain flipping in nanomagnetism and, for DLSRs, they promise to be comple- 
mentary to XFEL investigations. 


1.2.6 Radiation Equilibrium 


What determines the electron emittance? The emittance of a storage ring is deter- 
mined by the opposing influences of two phenomena: radiation damping (some- 
thing you want) and quantum excitation (something you don't). At the NSLS II in 
Brookhaven, the machine performance is optimized by maximizing radiation damp- 
ing, while in the next generation of DLSRs such as MAX-IV, quantum excitation 
has been minimized. As we have already stated, radiation damping improves the 
emittance by reducing the transverse momentum component [see Fig. 1.7a]. When 
an electron emits a photon, it loses the energy of the photon. This causes it to oscil- 
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Fig. 1.7 Radiation equilibrium between quantum excitation and radiation damping. a Radiation 
damping. An electron traversing a magnet in an insertion device is made to deviate from the central 
axis by an angle 8, due to the Lorentz force. Emission of a photon with momentum A/4 = hk 
will be in the direction of the electron at that instant in time. Conservation of momentum dictates 
that the electron’s momentum will be reduced to p' = p — A/A. The same electron will regain this 
momentum loss dp after travelling through the RF cavity; importantly, this will now be parallel 
to the central axis, thus reducing the angle of the electron's momentum p” to the central axis and 
hence also the electron beam's emittance. b Quantum excitation. An electron loses energy due to 
the emission of a photon and begins to oscillate around a new reference orbital path with a smaller 
radius. These oscillations induce a stochastic distribution of transverse momenta, thereby increasing 
the emittance. Reproduced from [3] with permission (Copyright 2019, John Wiley and Sons) 


late around a new reference orbit, thus broadening the beam and thereby increasing 
the emittance [Fig. 1.7b]. Moreover, the dispersion of the electron beam increases. 
Quantum excitation can be reduced by designing the magnet lattice so that the elec- 
tron energy dispersion is minimized at the main locations of radiation, namely the 
bending magnets. This is achieved by horizontal focusing at the bends and the use 
of many small deflection angle bends in multibend achromat lattices (see Sect. 1.4) 
to limit dispersion growth. 


1.2.7 Coherence 


We now consider coherence, including both longitudinal and transverse coherence. 
The latter depends on the source size and divergence of the photon beam, in other 
words, it depends on the emittance; longitudinal coherence depends on the band- 
width. The coherent fraction of a beam is critically important in lensless imaging 
techniques and photon correlation spectroscopy, plus also in phase-contrast tomog- 
raphy. Note also that bound up in the figure of merit of brilliance are the above 
parameters that quantitatively define coherence: the emittance and the relative spec- 
tral BW. Figure 1.8 provides a schematic summary of coherence. Brilliance really 
does encompass the most important qualities of synchrotron light; because however, 
it combines flux, spatial coherence and longitudinal coherence, it is important to 
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Fig. 1.8 Coherent radiation. The coherent fraction of a broadband, spatially extended, source can be 
selected as follows. Firstly, a pinhole selects a small, spatially constrained fraction of the radiation 
thereby acting as a secondary, quasi-pointlike, source. This secondary source has an improved 
transverse (or spatial) coherence, as the emittance, which is the product of the radiation’s divergence 
and source size, is now much smaller. Next, a filter (which is normally a monochromator) selects 
a narrow BW which is much narrower than the original source. Now, the radiation is spatially and 
longitudinally (or temporally) coherent. Note that both the emittance and relative spectral BW are 
included in the definition of brilliance. Reproduced from [3] with permission (Copyright 2019, John 
Wiley and Sons) 


know which factors determine a given value. For example, is the brilliance high 
because there are more photons on the sample, or because the emittance is small? 

No X-ray source has an infinitely narrow bandwidth. Consequently, the different 
frequency components within the beam will sooner or later drift out of phase with 
one another. The time for the phase between two waves differing in frequency by 
an amount Av but which are initially in phase to differ by z radians (i.e. from fully 
constructive to fully destructive) is simply 1/2Av. This is known as the so-called 
longitudinal coherence time, At{”. During this time, the waves have travelled in 
vacuum a distance] = cAc(P /2, known as the longitudinal (or temporal) coherence 
length [see Fig. 1.9a], given by 


2 
[D =, (1.14) 
2AX 
The longitudinal coherence after a monochromator is usually determined by the rock- 
ing curve of the crystal or grating used in the monochromator, which defines A./ AA. 
For a perfect crystal with insignificant mosaicity, AA is limited by the so-called Dar- 
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win width, and A/Ad can easily exceed 10^—the relative bandwidth of a Si(111) 
single crystal is determined by the extinction depth and is approximately 1.4 x 1074. 
The longitudinal coherence length can thus be several micrometres, even in the hard 
X-ray regime. 

The transverse coherence length 1 (also called the spatial coherence length) 
results from the interference of waves having the exact same wavelength but with 
slightly different directions of propagation. This arises because all sources have a 
finite size D and a non-zero divergence A0 (that is, a non-zero emittance), as shown 
in Fig. 1.9b. In this case, 


I? = 1/2A0 2 AR/2D , (1.15) 


where D is the linear size of the finite source and R is the distance from the source to 
the observation point. If we assume the source has a Gaussian profile, determination 
of D requires integration of interference contributions across the entire source's 
intensity distribution. It emerges that / is related to the standard deviation of the 
beam size o,,, by 


I" = AR/27 aes) , (1.16) 
or, in practical units 
AAR 
Wg = 2801 E, (1.17) 
Ox,y [um] 


Note that the transverse coherence length can be made larger by judicious use of slits 
limiting the apparent source size and divergence, but obviously at the cost of flux. 
Beamlines such as coherent lensless imaging beamlines tend to be very long in order 
to maximize R. 

In the vertical direction, the source size at an undulator of, say, 2-m length, is of 
the order of o, = 2 jum. For 1-À radiation, this yields a vertical spatial coherence 
for an observer at 40 m of ie» = 564 um. The horizontal spatial coherence has 
traditionally been two orders of magnitude smaller than this, due to the very much 
larger extent of the electron beam in the orbital plane. The electron beam source 
size in DLSR storage rings in the horizontal direction may, however, be as small as 
ox = 10 jum; the corresponding coherence length /(^? is of the order of 0.1 mm. 


1.3 Sources of Synchrotron Radiation 


In this section, the three sources of SR are semi-quantitatively described, while the 
differences between the radiation produced by bending magnets and wigglers on the 
one hand, and undulators on the other, are presented in a heuristic manner. 
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Fig. 1.9 Beam coherence. a The temporal, or longitudinal, coherence length is determined by the 
monochromaticity of the source, while b the transverse, or spatial, coherence length depends on the 
photon beam emittance. Reproduced from [3] with permission (Copyright 2019, John Wiley and 
Sons) 


1.3.1 Bending Magnets and Wigglers 


As mentioned above, the electrons are maintained on a closed path via magnetic, or 
Lorentz, forces. The Lorentz force, Fz, is proportional to the cross-product of the 
magnetic field strength, B, and the charged particle's velocity, v, that is 


F —eBxv, (1.18) 


where e = 1.6022 x 107! C is the elementary charge. It acts perpendicular to the 
plane defined by B and v. For the sake of simplicity, we only consider those cases 
where Fz, B, and v are mutually orthogonal, and drop the bold face implying their 
vectorial nature. Now, 


F; —eBv. (1.19) 


We equate this with a centripetal force mv? / p, where m — ym, is the relativistic mass 
of the electron travelling at a speed v ~ c. The bending radius of the centripetal force, 
p, is equal to the bending magnet radius. Therefore, to a high degree of accuracy, 


2 


uod d em (1.20) 
p 
E 
=—. 1.21 
=p ceB qn 
In practical units, we obtain 
E[GeV 
mec l, (1.22) 
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Fig. 1.10 Double-bend achromats. The angular dispersion of electrons of different energies as they 
pass through a bending magnet and the consequent increase in the electron beam emittance can be 
corrected in a DBA by placing a focusing quadrupole magnet (FQM) symmetrically in between 
two identical bends (BM). Lower energy electrons are bent through larger angles than are those of 
higher energy. Adapted from [3] with permission (Copyright 2019, John Wiley and Sons) 


For typical storage ring energies of a few GeV and magnetic field strengths of the 
order of 1 T, we obtain bending magnet radii of the order of 10 m. 

The relative spread in electron energy A£/£ in a storage ring is of the order 
of 10?. The bending radius p is directly proportional to the electron energy [see 
(1.21)]. Therefore, the path of those electrons with more (less) than the central energy 
will have a larger (smaller) radius, resulting in an unwanted increase in emittance. 
This problemis resolved by using an arrangement of bending and quadrupole magnets 
is known as a double-bend achromat (DBA, also called a Chasman-Green lattice, 
after its inventors [4]), shown in Fig. 1.10. 

The natural (i.e. minimum) horizontal electron emittance of a DBA with bending 
angle 20 (that is, 0 for each dipole pair) is given by 


€x,DBA = Cosa y 0? , (1.23) 


where Cppa = 114/5h/384 mec = 2.474 x 10 nm [5]. So, for example, the lower 
limit emittance of a 3 GeV storage ring containing 20 DBAs would be 3.3 nmrad, 
larger by nearly three orders of magnitude than the photons' diffraction-limited value 
of A/4 = 8pmrad calculated for 1-À radiation. Efforts to approach the diffraction 
limit by using multibend achromats (MBAs) are discussed in Sect. 1.4. 

The primary purpose of bending magnets is to maintain the electrons in the storage 
ring on a closed path. Bending magnets have typical magnetic field strengths of the 
order of 1 T. They produce bending magnet radiation in a flattened cone with a 
fan angle equal to that swept out by the path of the electrons due to the Lorentz 
forces they are subjected to. The relatively large subtended angle of bending magnet 
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Fig. 1.12 Bending -magnet 
spectra for on-axis 
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radiation for three different 
combinations of the storage 
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radiation, measured in degrees, allows one to accommodate more than one so-called 
*bending-magnet beamline’ at a single bending magnet. 
The spectral flux distribution is given by 


ph/s 


EM E 
(rad)? 0.150 BW = 1.325 x 1019 5, [mA]£? [GeV] (=) kal) , (1.24) 
s ‘0 » 


2Ec 


where E, = 3heB y? /2m, is the so-called critical, or characteristic, energy (which 
in practical units can be expressed as E,[eV] = 665.023 B[T] € ?[GeV?]), and 
K»j3(x = E/2E,) is the modified Bessel function of the second kind for nonintegral 
order (in this instance, 2/3), shown in Fig. 1.11. 

The spectral flux is thus determined by the storage ring energy and the magnetic 
field strength. Increasing B (but keeping € constant) shifts the maximum of the 
spectrum to higher photon energies but does not increase the spectral flux’s maximum. 
In contrast, only increasing € both shifts the spectral maximum to higher photon 
energies and higher values. 
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The broadband bending magnet spectra for three different combinations of storage 
ring energy and magnetic field strength are shown in Fig. 1.12. Particularly at low- and 
medium-energy facilities, the maximum values of both these quantities are too small 
to extend the spectrum far into the hard X-ray regime. However, if superconducting 
magnets with larger magnetic field strengths are employed, the photon energy range 
can be extended to harder X-radiation, as the critical energy is proportional to the 
magnetic field strength. Moreover, the radiative power increases with the square of 
the magnetic field strength B. These so-called *superbends' can have magnetic field 
strengths as high as 8 T. 

There are two types of insertion devices, distinguished from each other by the 
amount that the electrons are forced to deviate in a slalom-like path from a purely 
straight path. This at first seemingly subtle distinction has a fundamental effect on 
the nature of the radiation, however. For angular excursions substantially larger than 
the synchrotron radiation's natural opening angle y ^, the radiation cones from each 
magnet in the insertion device do not overlap. Under these conditions, the intensities 
produced from each dipole are added and the ID is referred to as a wiggler, which is 
briefly described below. 

For gentler excursions of the order of y ^ ! the ID is called an undulator, described 
in Sect. 1.3.2. 

The maximum angular deviation max of the electron oscillations in an ID is 
defined by the dimensionless *magnetic deflection parameter' K, given by 


Pmax = K/y . (1.25) 
K can be expressed in terms of the maximum magnetic field Bo as 


e Bo 
K = = 0.9342, , [cm] Bo[T] , (1.26) 


MeCKu, w 


where i, Or A, are the periods of the oscillations in the undulator or wiggler, respec- 
tively, and ky», = 2z:/A,,,. For a wiggler, K is typically between 10 and 50, while 
for undulators, K is close to unity and changes according to the size of the gap 
between the upper and lower magnet arrays. The horizontal spread in the electron 
beam divergence is 


0, — 2K[y . (1.27) 


So, for example, a wiggler having K — 20, operating in a 4 GeV storage ring would 
have a horizontal divergence of 5.2 mrad (0.30?). 

A wiggler can be thought of as being a series of bending magnets within a straight 
section of the storage ring that turns the electrons alternately to the left and to the 
right. The maximum angular excursion from the central axis is larger than the natural 
opening angle of the radiation, y~!. For each oscillation, the electrons are twice 
moving parallel to (and in reality also very close to) this axis. The radiation is therefore 
enhanced by a factor of 2N, where N is the total number of wiggler periods and is of 
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Fig. 1.13 Comparison of brilliances at a 3 GeV DLSR running at 400 mA between a U14 undulator 
with K — 1.6 (blue), a bending magnet with B — 1.41 T (red), a superbend with B — 4 T (yellow), 
and a wiggler with the same field strength as the bending magnet and 100 periods (green). Note 
that the peak brilliance of synchrotron sources can be calculated from the average brilliance shown 
in this figure by multiplying by the ratio of the pulse separation At ~ 5 ns to the pulse width 
At ~ 50 ps, that is, approximately a factor 100. The peak brilliance of XFELs is of the order of 
1074 phs-! mm^? mrad ? 0.196 BW7!, some ten orders of magnitude greater than that produced by 
DLSR undulators. Reproduced from [3] with permission (Copyright 2019, John Wiley and Sons) 


the order of 20. The spectrum from a wiggler has the same form as that from a bending 
magnet—it is broadband and thus produces a large amount of integrated radiative 
power, of the order of several kW. Thermal management of optical components 
is thus critical. Wigglers are therefore becoming fairly uncommon, particularly in 
fourth-generation DLSRs. 


1.3.2 Undulators 


In undulators, the angular deviation of the electrons away from the central axis is of 
the order of 1/y; the radiation cones emitted by the electrons thus overlap as they 
execute the slalom motion. Consequently, radiation from the dipoles interferes with 
one another. As such, the field amplitudes are added vectorially (i.e. including the 
phase difference from each contribution) and the sum of this is squared to produce 
the intensity, which peaks at those wavelengths where interference is constructive. 
Undulators therefore differ fundamentally from bending magnets and wigglers in 
that their spectral flux reflects this interference phenomenon and is hence concen- 
trated in evenly separated, narrow bands of radiation (Fig. 1.13). The first practical 
undulator device to operate in the X-ray regime was constructed by Klaus Halbach 
and co-workers at the Lawrence-Berkeley National Laboratory and tested at the 
SSRL synchrotron at Stanford in 1981. This breakthrough was thanks on the one 
hand to the development of novel magnetic alloys such as SmCos [6], allowing the 
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construction of magnet arrays with the required small periodicities and high mag- 
netic field strengths [7]; and on the other, to a clever arrangement of pole orientations 
(referred to as the *Halbach array’) which effectively suppresses the field strength 
on one side of the array and doubles it on the other, thus maximizing the magnetic 
flux where it is needed. 

The four basic parameters for undulator radiation from a device of length L are the 
relativistic Lorentz factor y, the undulator spatial period A,,, the number of periods in 
the magnet array N = L/A,, and K. As already stated, for an undulator, K is about 
unity. K can be varied by changing the gap size between the upper and lower arrays of 
magnets; this tunes the spectrum so that a suitable near-lying spectral maximum sits 
at the desired photon energy. The transformation from wiggler to undulator radiation 
is achieved in practice not by reducing the lateral excursions through reduction of 
the magnetic field strength between the magnetic pole pairs—this would result in 
an unacceptable drop in flux—but instead by reducing the magnetic pole spatial 
periodicity A,, [see (1.26)]. 

It emerges that the condition for constructive interference is given for the mth 
harmonic by 


Àu K? 
MÀm = 1+ ; (1.28) 


or in practical units 


: 2 
_ 13.056, [em] ( K ) (1.29) 
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The intrinsic source size and divergence of undulators associated exclusively with 
photon emission (i.e. ignoring the electron emittance) are given by 


1 
c? = —VAL (1.30) 
4r 
and 
P — A (1.31) 
- rs . 
resulting in an intrinsic emittance of 
Xr 98.66 
P d] = — = . 1.32 
PIPER pen 


Note that the divergence o’? can also be expressed in terms of the harmonic number m 
and number of periods N, and is approximately equal to o"? = 1/(y4 mN). The 
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Fig. 1.14 Measurements of optimized gap positions for different photon energies and utilized har- 
monics of the U14 undulator of the Materials Science beamline, SLS, from the second to nineteenth 
harmonic. In normal operation, scanning to higher energies within any given harmonic is achieved 
by opening the undulator gap (the progression of this between the third and fifth harmonics is shown 
as the dot-dashed blue lines). One moves from a given harmonic to the next higher (red dot-dashed 
lines) when the desired photon energy can be accessed at the higher harmonic with a gap size no 
smaller than the minimum allowed value, here 4 mm, shown as the blue horizontal dashed line. 
Adapted from [8], with permission (Copyright 2013, IUCr) 


divergence thus becomes smaller with harmonic number and number of periods in 
the undulator. 

The interference spectrum at an angle 0 away from the central axis of the undulator 
is shifted towards lower energies, and is given by 


MAm(0) = E 1+ = +y). (1.33) 
2y? 2 


For example, an observer positioned off axis by half the natural opening angle 0 — 
1/2y (approximately 3.5 mm at a distance of 40 m for a facility with a 3 GeV storage 
ring energy) would, for K — 1, see a spectrum shifted by a factor 7/6 to the red. 

The spectral width of the undulator harmonics is inversely proportional to the 
number of periods, N. As in any interference or diffraction set-up, the condition 
for constructive interference becomes increasingly strict the larger the number of 
involved 'scatterers' (in this instance, the 2N magnet pairs). Hence, the inverse of 
the relative bandwidth, called variously the monochromaticity or the quality factor 
Am/AAm = Vg / AVm, is equal to the number of periods N multiplied by the harmonic 
number m. As an example, the tenth harmonic of an undulator consisting of N = 70 
periods has a relative bandwidth AA,,/A,, of 1.4 x 107. 

The undulator spectrum is tuned by varying K. This is achieved by changing the 
gap between the two sets of magnetic poles and thereby the magnetic field strength Bo 
[see (1.26) and Fig. 1.14]. 

Note that, for reasons of symmetry, even harmonics are in general weaker than odd 
harmonics. Higher K undulators provide both more intense higher-energy harmonics 
than lower K devices, while the difference in intensities between even and odd 
harmonics is less pronounced. 
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1.3.3 Polarization of Synchrotron Radiation 


Polarization of light in spectroscopy is a highly valuable tool, as certain selection 
rules imply that certain electronic features can be distinguished merely by using 
differently polarized X-radiation, for example, in the study of surface absorbates on 
crystalline surfaces, or in distinguishing between differently oriented ferromagnetic 
or antiferromagnetic domains. Magnetic dichroism using tunable and polarized soft 
or hard X-rays at synchrotron sources offers the unique feature of chemical specificity. 
Magnetic features down to below the 100-nm scale can be imaged by exploiting the 
dependence of the absorption on the polarization of X-rays in ferromagnetic and anti- 
ferromagnetic materials. The methods are called X-ray magnetic circular dichroism 
(XMCD) and X-ray magnetic linear dichroism (XMLD), respectively, for ferromag- 
netic and antiferromagnetic materials (see Chap. 4). Imaging down to the nanometre 
range is very important for magnetic structures, as at this scale, the influence of 
domain boundaries between one magnetic direction and another becomes significant 
and new phenomena can occur which would be negligible in larger structures. An 
understanding of the energetics of nanomagnetism is thus essential in the drive to 
further miniaturize magnetic memory storage devices. 

The polarization of undulator radiation can be controlled and varied in so-called 
APPLE (Advanced Planar Polarized Light Emitter) undulators. APPLE undulators 
have undergone successive refinements (reflected by their generational names of 
APPLE-I, II, III and X), based on the flexibility of movements of their magnet 
arrays; here, we consider only the basic principles. 

Instead of the two Halbach magnet arrays found in ‘normal’ undulators discussed 
thus far, APPLE undulators consist of four arrays (two above, two below) that can 
be longitudinally shifted relative to each other [A1—A4, Fig. 1.15a] [9]. Each array 
consists of a periodic repetition of four different orientations [down (blue @); reverse 
(green arrow); up (yellow ©); and forward (red arrow), see Fig. 1.15b]. With all array 
components aligned longitudinally, normal linear horizontal (LH) radiation as found 
in ‘normal’ undulators is produced. Antisymmetric shifts (that is, one array moving 
in the positive z-direction, the other in the negative) of Al and A4 between —2,,/2 
and +A,/2 will maintain linearly polarized radiation but will vary the tilt angle 
from linear vertical (LV at —4,,/2), through LH (zero shift) back to LV (4-A,,/2). 
Conversely, symmetric movements of Al and A4 by approximately +A,,/4 will 
produce right circularly polarized light (at approximately —2,,/4), or left circularly 
polarized light (at 4-4, /4). In this last instance, the exact shift in the arrays depends 
on the gap size and details of the magnet strengths. Symmetric shifts smaller than 
this result in elliptical polarization, that is, radiation with both a linear and circularly 
polarized component. 

Synchrotron sources provide fluxes and brilliances many orders of magnitude 
greater than those from laboratory-based sources. Third-generation facilities were 
defined by their use of insertion devices, most importantly undulators. The fourth 
generation of synchrotrons, just beginning to come online at the time of writing, 
increase brilliances by two orders of magnitude by reducing the horizontal (orbital 
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Fig. 1.15 APPLE undulators. a APPLE undulators consist of four magnet arrays Al, A2, A3 and 
A4. b The magnet periodicity à„ is composed of four magnet orientations. Viewed from above, these 
are down (blue &); reverse horizontal (green arrow); up (yellow ©); and forward horizontal (red 
arrow). When all four arrays are aligned longitudinally, linear horizontal polarization is produced. c 
By shifting arrays Al and A4 either symmetrically or antisymmetrically by +A, /2, linear vertically 
polarized radiation is produced. d Linear polarization of any desired tilt angle can be selected by 
moving A1 and A4 antisymmetrically relative to A2 and A3. e and f Circularly polarized radiation 
is generated by symmetric shifts of Al and A4 by approximately A,,/4, the exact shift depending 
on the gap size and details of the magnetic field strengths. Reproduced from [3] with permission 
(Copyright 2019, John Wiley and Sons) 


plane) emittance through the implementation of MBAs. We discuss this in more 
detail now. 


1.4 Diffraction-Limited Storage Rings 


As we have already intimated, the main limit to the emittance in storage rings is due 
to the electron spread induced at bending magnet achromats as a consequence of 
quantum excitation (Sect. 1.2.6). In our discussion of DBAs in Sect. 1.3.1, the lower 
limit to the electron emittance, its so-called *natural emittance', was given by (1.23). 
However, if we extend an arc section simply by increasing the number of dipoles 
from 2 to M in a so-called MBA, and keeping the swept angle per dipole constant, 
the ratio between the natural emittances of the DBA and MBA is 
€x,DBA M-1 


—3 ; 1.34 
€x, MBA M+1 ( ) 
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Such an MBA, subtending an arc of angle M/2 times larger than that of the DBA, 
will have a superior emittance up to a factor of three for large M. So, for example, a 
10? DBA will have a natural emittance 2.25 times that of a 35? 7BA (M — 7) fora 
fixed swept angle per dipole. 

This is clearly not the way to proceed, as an increase from two bends to M will 
reduce the number of straights around the ring by a factor of M /2. A far more effective 
way to improve the natural emittance of an MBA can be achieved by reducing the 
swept angle 0 per dipole. This is due to the cubic dependence of the former on the 
latter [see (1.23)]. So, if in the example above, the 7BA is designed to sweep the same 
total angle as the DBA, the reduction in emittance will now be (4/9) x (QT? = 
32/3087 7: 1/100, thus producing emittances which assume values of the order of 
100 pmrad. This is only an order of magnitude larger than the diffraction-limited 
photon emittance for hard X-rays of approximately 10 pmrad. Indeed, soft X-ray 
beamlines, for which 4/477 ~ €x mpa, are already diffraction-limited when using 
MBAs. 

Note also that the natural emittance scales with the square of the storage ring 
energy €. The two highest-energy third-generation storage rings, namely the APS 
(7 GeV) and SPring8 (8 GeV) are both planning to decrease € to 6 GeV in their 
upgrades, despite the associated decrease in the highest accessible photon energies. 

In most instances, because an achromat (be it a DBA or MBA) requires about 
the same length of ‘real estate’ within the storage ring, the ring’s circumference C 
is approximately inversely proportional to 0—large rings have bend achromats that 
subtend smaller angles than those at small rings. This is why the large APS facility 
has 36 sectors, while the smaller ALS in Berkeley only has 12. Thus, for a fixed cell 
design, a convenient figure of merit 


e, C? 


M= 2 


(1.35) 


describes how well optimized the magnet lattice is [10], as this takes into account 
the available real estate and electron beam energy and weighs the emittance by this, 
accordingly. 

The newest (fourth) generation synchrotron facilities are thus dubbed ‘diffraction- 
limited storage-rings’ (DLSRs) [10-13]. A light source is said to be diffraction- 
limited if the horizontal electron beam emittance is smaller than that of the radiated 
photon beam (A./47 ). In practical terms, the diffraction-limited photon energy EpL 
with a wavelength equal to 4x €x, is given by 


98.66 


By this metric, even a second-generation facility would be a DLSR for a beam- 
line generating infrared radiation; in contrast, most third-generation facilities are 
diffraction-limited in the very soft X-ray regime at around 25 eV. Note that beam- 
lines that use photon energies that lie near to or below the diffraction limit for the 
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facility where they are installed gain little or nothing from the emittance of the stor- 
age ring being further improved (i.e. reduced by more sophisticated electron optics). 
This is the true meaning of the diffraction limit: the fundamental lower value set by 
the photon beam rather than the electron beam. 

DLSRs are thus defined by their use of MBAs. They gain most at large- 
circumference storage rings, at medium storage ring energies and indeed the two 
greenfield DLSR facilities to date (MA X-IV and Sirius) plus the first major upgrade 
from third to fourth generation (ESRF-EBS) reflect these characteristics. This seem- 
ingly obvious and long-understood approach to improve the horizontal emittance via 
the use of MBAs and thereby the brilliance has only recently been pursued because, 
until now, the costs (both in hardware and real estate) and the introduction of poten- 
tial mechanical misalignments associated with increasing the number of elements in 
the magnet lattice, were considered unacceptable. Miniaturization of these magnet 
lattice components and the development of multifunctional magnets machined from 
a single yoke block have decreased both costs and the necessary circumference of 
the ring to accommodate the MBAs and thus achieve these goals. The PETRA-IV 
project is especially interesting, as the circumference is 2.3 km, allowing exception- 
ally shallow angles for the arcs, promising diffraction-limited photon energies well 
into the hard X-ray regime, at approximately 6 keV. 

Another serious obstacle to reducing the size and separation of the magnets is 
that the lateral dimensions (that is, the cross-section) of the storage ring vacuum 
vessels containing the electron beam need to become so small that pumping them with 
traditional pumping equipment, in particular, ion-getter pumps, becomes increasingly 
difficult. In recent years, however, a novel approach to achieve ultrahigh vacuum 
(UHV) conditions has been developed, namely the use of non-evaporable getter 
(NEG) coating of the inner walls of the storage ring vacuum vessels [14, 15]. NEGs 
are porous alloys of Al, Ti, Fe, V and Zr sintered onto the inner walls of the vacuum 
vessel to a thickness of the order of a micrometre, or even as little as 100-200 nm. 
After installation, the vacuum vessel is pumped to a moderately high vacuum using 
traditional pumps and then heated out to temperatures below 200? C. This activates the 
NEG material, allowing pressures to drop to approximately 107!? mbar in the UHV 
regime. Importantly, even very narrow spaces can be readily coated. Moreover, recent 
developments in computer numerical control of machining storage ring components 
allow micrometre accuracy such that geometrically near-perfect miniature vacuum 
vessels with cross-sections of the order of a square centimetre can be constructed. 

With the advent of DLSRs and improvements in magnetic materials, undulator 
spectra have undergone important transformations. Figure 1.16 compares the bril- 
liance of the same undulator at a third-generation source and at a DLSR having a 
40 times smaller total horizontal emittance. In addition to the expected 40 times 
increase in peak brilliance, the DLSR-spectrum is substantially cleaner: the lobes 
seen on the low-energy flanks of the spectral peaks for the third-generation source 
are completely absent in the DLSR spectrum. This is because the horizontal width 
of the electron beam is much smaller (typically by an order of magnitude). In third- 
generation facilities, the horizontal electron beam width is, at approximately 100 um, 
two orders of magnitude larger than the oscillation amplitude A, which is of the order 
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Fig.1.16 Comparison ofthe brilliance of undulator spectra at third- and fourth-generation facilities. 
Top: the lateral extent of the electron beam passing through an undulator at third-generation facilities 
is approximately two orders of magnitude larger than the oscillation amplitude A, while at DLSRs, 
it might only be approximately 10A. Bottom: as a consequence, an observer on-axis will ‘see’ less 
off-axis radiation, given by (1.33), thereby suppressing the lobes on the low-energy flanks of the 
main spectral maxima. Note also the enhanced brilliance at the spectral peaks for the DLSR. Both 
simulated spectra were generated for a U12 undulator (that is, à, = 12 mm) containing 120 magnet 
periods, for K — 1.6, 400 mA and a storage ring energy of 2.4 GeV. Courtesy Marco Calvi, Paul 
Scherrer Institute. Reproduced from [3] with permission (Copyright 2019, John Wiley and Sons) 


of a micrometre. This means that an observer on axis at the DLSR-undulator will 
see a much smaller contribution from emission from off-axis electrons, and it is 
these that produce the low-energy lobes in the spectra, according to (1.33). Many 
experiments do not require the relative BWs of the order of 10~* provided by crystal 
monochromators and would profit from using the entire flux from any given har- 
monic. The relative BW of the mth undulator harmonic is 1/mN, where N is the 
number of undulator periods. For the lower harmonics, this is of the order of 0.005— 
0.01; for higher harmonics, the relative BW does not drop as steeply as 1/m, as 
gradually, the off-axis contributions do begin to leak into the peaks, causing them 
to broaden marginally. Nonetheless, the spectral quality remains sufficiently high 
to use the entire flux of any given harmonic for small-period undulators at DLSRs. 
Note, however, spectral filtering is still required to remove the other harmonics. This 
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can be effectively achieved using multilayer monochromators or, in some instances, 
refractive optics such as compound refractive lenses or prisms [16]. 

Improvements don't stop here, however. A critical aspect of undulator design is 
that the magnetic field must be exceedingly homogeneous in the x-direction (that 
is, in the horizontal plane, perpendicular to the undulator axis) in the region where 
the electron beam propagates. This has meant that, for third-generation facilities, the 
magnet yokes need to be a few centimetres wide or more in the x-direction. The 
approximately five to ten times smaller extent of electron beams passing through 
undulators installed in DLSRs means that the yokes can be of the order of one 
centimetre. This allows the design of compact magnetic ‘funnels’, concentrating 
magnetic field lines and thus increasing the magnetic field strength, which, in turn, 
allows for a more compact design and shorter undulator periods. Moreover, this 
means that more periods (N) can be fit into a given undulator length L. 

The reduction of the horizontal electron emittance in DLSRs thus lends many 
potential scientific opportunities and will drive exciting innovations along the full 
technology chain, from the sources (in particular, undulators), through improved X- 
ray optics (regarding minimization of optical imperfections and aberrations), detector 
technology, to handling of large data volumes. The first DLSR, MAX-IV in Lund, 
Sweden, came online in Summer 2016, while a second greenfield facility. Sirius in 
Campinas, Brazil and the ESRF, the first facility to undergo an upgrade from third 
generation to DLSR status, both began pilot experiments in early 2020. 

Several orders of magnitude higher peak brilliance are provided by high-gain 
X-ray free-electron lasers (XFELs). The machine science and technologies for this 
paradigm shift in X-ray sources are based on fundamentally different principles, and 
consequently, XFELs are discussed separately in the next section. 


15 X-Ray Free-Electron Lasers 


Radiation from fourth-generation DLSRs has some important common features with 
laser radiation: it is very intense, collimated and, in the case of radiation from undula- 
tors, partially monochromatic. An important distinction, however, is that the degree 
of transverse (spatial) coherence of synchrotron radiation, although much improved 
at DLSRs compared to that produced by third-generation facilities, is still only of the 
order of a few percent in the hard X-ray regime. This is because, although radiation 
from any single electron travelling along an undulator is coherent, there is no spatial 
(i.e. phase) correlation between different electrons, and hence their combined output 
remains largely incoherent. In contrast, visible lasers are normally close to being 
100% coherent. 

Moreover, the shortest pulse duration of X-rays from synchrotrons is a few tens 
of picoseconds and may be as large as a few hundred picoseconds in the case of 
DLSRs. Lasers in the visible and near-visible regimes can have pulse lengths as small 
as 70 attoseconds in some exceptional cases; more representatively, pulse lengths of 
a few femtoseconds are routinely achieved. The production of femtosecond light 
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pulses has been motivated by studies of the dynamics of chemical reactions—this 
is one of the most important goals in science, as a biomolecular understanding of 
the processes of life, plus most industrial chemical processes, depend intimately 
on understanding transient intermediate states. Indeed, the holy grail of physical 
chemistry is the direct observation of the making and breaking of chemical bonds 
and following reactions through the reconfiguration of the atomic structure. In order 
to achieve this, one needs a probe that can spatially resolve the atomic positions, 
measured in angstroms, thus necessitating the use of hard X-rays. Moreover, in order 
to record a movie of chemical processes, each ‘frame’ must be shorter than the speed 
of the process divided by the desired resolution, which we have just stated should 
be of the order of an angstrom. Typical velocities associated with atomic motion 
are of the order of 1000 ms~', hence each movie frame should be separated by 
no more than approximately 100 fs. For example, phonons and molecular rotations 
have time scales of approximately 100—1000 fs, while molecular vibrations, crucial 
to the process of chemical reactions, are shown to have periodicities of the order of 
10-100 fs. 

X-ray free-electron lasers (XFELs) were developed to fulfil this goal. They provide 
extremely intense and short pulses of X-rays. XFELs and synchrotron facilities differ 
substantially in this respect (see Table 1.1). The total average number of photons 
per second delivered to a third- or fourth-generation synchrotron beamline after 
monochromatization (typically between 10!? and 10!^ s7!) is comparable to that of 
an XFEL beamline, although the latter can vary by orders of magnitude, depending 
on the facility repetition rate and whether the beam is monochromatized or not. In 
contrast, their time structures are very different: synchrotrons deliver pulses with 
the same frequency as the RF supply, typically several hundred million pulses per 
second (1/ At); each pulse is maybe a few tens of picosecond long (AT) and contains 
the order of 107-10? photons. XFELs deliver anything between approximately 100 
and several hundreds of thousands of pulses per second, depending on the electron 
gun source and the properties of the accelerating LINAC. Most importantly, each 
XFEL pulse is only a few tens of femtoseconds long (and can be even shorter, 
down to a femtosecond) and contains the order of 10? photons. The peak arrival 
rate of photons at XFELs is thus approximately 10 billion times higher than that at 
synchrotron beamlines. 

XFELs thus provide the tools needed to study the dynamics, physical properties 
and structure of materials with unsurpassed spatial detail and on a time scale over a 
thousand times shorter than that which is possible at synchrotron facilities. 

The beam quality in storage rings is limited by the stochastic competing processes 
described above in radiation equilibrium. High-gain XFELs are made possible by 
a runaway process called self-amplified spontaneous emission (SASE). Because, in 
contrast to synchrotrons, the electrons in XFELs are not stored and require only a 
few microseconds to traverse the length of the entire XFEL facility, they are far less 
perturbed than are the equilibrated electrons in a storage ring, meaning they can 
maintain a very low emittance defined by the electron gun, acceleration and bunch 
compression mechanism. 
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Table 1.1 Comparison of orders of magnitude synchrotron and XFEL radiation! properties. ' XFEL 
values derive from LCLS unless otherwise stated. 8-keV photons assumed. *EuroXFEL time struc- 
ture: 2700 pulses at 4.5 MHz, 10 such bursts per second. ** photons s~! 0.1% BW-!. * After Si(111) 
monochromator, Av/v = 1.4 x 107^. TUnmonochromatized, full SASE spectrum. 323 kW during 
pulse burst 


Property Synchrotron XFEL 

AT 50 — 400 ps 1 — 100 fs 

At 5 ns 107? -2 x 1077 s* 
Average flux** 2 x 10!4 1014 

Peak flux** 6 x 1055 2x 105 

#hv/pulse 4 x 10* 5 4 x 102 

Peak power 1w? 10!! wil 

Average power 25 mW* 600 mw! — 140 Wits 


The linear architecture of XFELs is very different than that of synchrotron facil- 
ities; it is summarized in the following. 


1.5.1 XFEL Architecture 


The fate of electrons in a high-gain XFEL proceeds as follows. In order to generate 
pulses of electrons, a ps laser is focussed on to the surface of a photoemitter. The 
resulting ps-duration electron pulse contains a charge of the order of 300 pC (or 
2 x 10° electrons) and is accelerated in a first LINAC to energies of the order of 
several hundred MeV (see Fig. 1.17). Shorter laser pulse durations containing the 
same total number of electrons are excluded due to Coulomb repulsion, which would 
blow up the beam size and thus spoil the emittance. However, once the electrons 
become relativistic after acceleration in the first, short, LINAC, the bunch can be 
compressed without spoiling the emittance. This is achieved in one or more bunch 
compressors; thereafter, the electrons are further accelerated in a longer LINAC, 
before entering a very long undulator array. It is while the electrons are travelling 
along the undulator that the SASE process takes place, resulting in the emission of 
femtosecond-duration pulses of X-rays. 

The initial few-ps-duration electron bunch produced by the low-emittance gun 
has a transient peak current of about 50 A (300 pC/6 ps). Bunch compression is 
required in order to reduce the pulse duration to approximately 300 fs and produce 
the several thousand amperes peak current required to induce SASE (Fig. 1.18). 

The necessary factor of approximately 20-50 increase is realized by adjusting 
the RF phase in the first accelerator module (LINAC 1 in Fig. 1.17) to allow the 
electrons to ‘surf’ down the slope of the sinusoidal RF field—the leading electrons 
experience a slightly smaller acceleration by the sinusoidal electric field of the RF 
cavity and are thus accelerated less than those electrons towards the back of the 
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Fig.1.17 Schematic of XFEL architecture. Electron bunches are emitted from a low-emittance gun 
(LEG) irradiated by picosecond laser pulses. They are then accelerated in a short LINAC (LINAC 1), 
then compressed using one or more bunch compressor magnet chicanes (BC); they are then further 
accelerated using a much longer LINAC (LINAC 2) before entering a long undulator, typically a 
few hundred metres in length. The SASE process along the undulator produces extremely intense 
X-ray pulses with durations of the order of 50 fs. Reproduced from [3] with permission (Copyright 
2019, John Wiley and Sons) 
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Fig. 1.18 Bunch compression in high-gain XFELs. Before entering the first LINAC, the electron 
density ne and spread in kinetic energy A E are both relatively low. The phase of the LINAC RF-field 
relative to the passage of the bunch is so adjusted to induce a larger spread of the electrons’ kinetic 
energies, whereby the bunch's trailing edge is made to be more energetic than the leading edge. 
Compression is achieved by allowing the bunch to pass through a four-dipole chicane, where the 
faster electrons at the back can catch up with the slower electrons at the front, thanks to the shorter 
path that they execute. Reproduced from [3] with permission (Copyright 2019, John Wiley and 
Sons) 


bunch (a phenomenon referred to as ‘chirping’). However, the electrons’ velocities 
are already so close to the speed of light that any differences are far too small to allow 
the faster electrons at the back of the bunch to catch up with the slower electrons 
positioned further forward and thereby squeeze the bunch length. Instead, the bunch 
passes through a magnetic four-dipole chicane. The trailing (high-energy) electrons 
execute a shorter path through the chicane because they are less deviated by the 
chicane's magnetic field. This shorter path means that they catch up with the less 
energetic leading electrons, which are more deviated by the magnet chicane. This 
compression thus shortens the bunch duration to approximately 300 fs (equating to a 


30 P. R. Willmott 


bunch length of 100 um) and increases n, by a factor of approximately 20-50. The 
increased energy spread of the chirped electron beam after bunch compression must 
be removed through 'dechirping', which can be achieved in one of several ways [18, 
19]. 


1.5.2 The SASE Process 


As the electrons begin to propagate down the undulator, they initially emit X-rays 
independently and stochastically and are bathed in this radiation. They will interact 
with this EM field. We begin our discussion of SASE by determining the magnitude 
of these forces in conventional undulators at synchrotrons. 

Classical electromagnetism tells us that the power transmitted per unit area by an 
EM plane wave is given by 


P Ey Ho 


A 2 "' 


(1.37) 


where A is the cross-section of the beam, Eo is the amplitude of the electric field 
component and Hp is the amplitude of the magnetic intensity, which, in vacuum, is 
related to the magnetic field strength amplitude Bo by 


B 
Hy = — 


= : (1.38) 
Ho 


where uo = 4z x 1077 mkgC^? is the permeability of free space (given here in SI 
units). Also from classical electromagnetic theory, it emerges that 


E 
Bye (1.39) 
C 


From (1.37) to (1.39), we thus obtain an areal power density 


P E$ 


A = 2uoc ` 


(1.40) 


A typical synchrotron undulator may have a source size of A = 100 x 10 wm”, pro- 
duce light bunches of 50-ps duration, each containing 5 x 10° 1-À photons (within 
the full width of a harmonic). This equates to an areal power density of the order 
of 1.6 x 10!! Wm. From (1.39) and (1.40), one calculates an electric field ampli- 
tude Ey ~ 107 V m-! and Bo ~ 40 mT. This latter value is nearly two orders of 
magnitude smaller than that imposed by the undulator's magnet array. We can thus 
conclude that, in a conventional third- or fourth-generation synchrotron facility, the 
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forces of the electromagnetic field generated by undulator radiation on the electrons’ 
trajectory are negligible. 

In the case of XFELs, however, the beam is tailored to have as low an emittance 
and as high an electron density as possible, through the low-emittance gun and bunch 
compression, respectively. The motivation for this is exactly to induce forces through 
the generated radiation that are sufficiently large that they do indeed have an impact 
on the energy and spatial distribution of the electron bunch. Although at the upstream 
end of the XFEL undulator, this interaction is still very weak, it is sufficient to seed 
the runaway process of SASE, which is now briefly described. 

The ratio of the Lorentz force and electric field force experienced by an electron 
bathed in EM radiation is 


cup = —Fpß . (1.41) 
C 


The electrons within XFELs are highly relativistic and hence to a high degree of 
accuracy, 6 ~ 1 and FT ~ — Fg (the fractional imbalance in the forces is equal to 
AF/Fg = 1/2y? ~ 2 x 107?). Thus, for an electron moving exactly along the axis 
of the EM radiation (that is, ignoring for the time being the oscillations induced 
by the magnet array of an undulator), the electric and Lorentz forces are equal and 
opposite and point perpendicularly to the beam propagation. 

If the electrons move in a straight line parallel to the EM plane wave, both the 
electric and magnetic forces (which anyway cancel each other out) always act at 
right angles to the electrons' direction of propagation and thus could transfer energy 
to or from the electrons. What happens if we now allow the electrons to move in 
directions that are not precisely parallel to the EM radiation, such as in the slalom path 
induced by an undulator's magnet array? Let us now consider those positions exactly 
in between the magnet poles, where the electrons have their maximal transverse 
velocity. 

The first observation to make is that, because the magnetic field of the emitted EM 
radiation always lies perpendicular to the plane of the electrons’ trajectory, it can- 
not transfer energy, but only vary the direction of the electrons’ motion. In contrast, 
the electric field part of the emitted radiation has a component parallel to the elec- 
trons' trajectory and can either decelerate them or accelerate them. Consequently, 
some electrons will be accelerated while others are decelerated. These two opposing 
interactions cause the electrons to form microbunches within the ‘normal’ bunch, 
separated by a distance equal to the wavelength of the light they both generate and 
are bathed in. Although, for hard X-rays, the ‘normal’ bunch length of approxi- 
mately 100 um equates to approximately 10° microbunches, SASE only begins in 
the coincidentally most intense portion of the bunch. As such, SASE is a stochastic 
process. The duration of the SASE radiation therefore depends on the degree of bunch 
compression and the integrated charge of the initial bunch as it enters the undulator 
array. Low-charge bunches will thus produce XFEL pulses with lower peak brilliance 
but with durations that can be shorter than 1 fs. More commonly, 'standard' bunch 
durations are a few tens of femtoseconds. 
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Fig. 1.19 Incoherent and coherent radiation from an ensemble (bunch) of electrons. If the relative 
longitudinal positions of the electrons bear no correlation to the radiation they emit, the relative 
phases of that radiation are likewise random. From normal statistics, it can be shown that their 
summed amplitude is equal to the amplitude of the radiation emitted from one electron multiplied 
by the square root of the number of electrons (ne) in the bunch. The intensity (proportional to the 
square of the amplitude) is therefore proportional to ne. In contrast, the radiation induced by SASE 
is fully coherent, as the emission from all electrons is in phase. The amplitudes of the radiation from 
each of the ne electrons thus add linearly, resulting in an increase in intensity by n2. Reproduced 
from [3] with permission (Copyright 2019, John Wiley and Sons) 


SASE is a runaway effect. The more the electrons bunch together into 
microbunches, the stronger is the radiation they produce, leading to stronger associ- 
ated forces on those electrons and still tighter microbunching. 

An important property of the microbunches produced by SASE is the following. 
In the case of normal SR, the emission of radiation from n, electrons within a 
bunch is stochastic and uncorrelated, and hence the vector addition of the individual 
amplitudes will follow normal statistics and, on average, be equal to Ai = /neA, 
where A is the amplitude of the contribution to the radiation of any one electron. 
But the intensity is proportional to the square of the amplitude and is therefore 
proportional to ne. 

In the case of XFEL SASE radiation, the initial bunch is modified so that it 
becomes a set of microbunches separated from their neighbours by the wavelength 
of light produced by the undulator interference. It follows that the individual contri- 
butions to the radiation amplitude will all be more or less in phase and the electrons 
will all emit coherently. Thus, the amplitudes add linearly as they all have the same 
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phase and hence Atot = ne A. The intensity will be proportional to n? (see Fig. 1.19). 
This is the key to the ‘runaway’ amplification in SASE: the more the electrons bunch 
and radiate in phase (that is, coherently), the stronger is the EM-field interaction 
with them. This in turn further enhances the microbunching phenomenon associated 
with coherent emission, thus increasing the degree of coherent emission in a positive 
feedback loop. 

This runaway instability causes the light intensity to grow exponentially along the 
undulator until the process saturates [20]. The microbunches produced by the SASE 
process each contain approximately n, — 10? electrons, resulting in a peak brilliance 
for XFELs approximately one billion times larger than that from fourth-generation 
synchrotron sources! 

The quality of a high-gain XFEL is encapsulated in the dimensionless so-called 
Pierce, or FEL, parameter, ppg, given approximately by 

ron K? Ws 

DFEL — ES , (1.42) 

where A, is the undulator period length, ro = 2.82 x 10-P m is the Thomson scat- 

tering length (also known as the classical electron radius), N, is the electron density 

of the pulse bunch as it enters the undulator, K is the deviation parameter, and 

y = €[m,c? is the Lorentz factor. Equation (1.42) can be re-expressed in practical 
units as 


1.55 x 102? 
E[GeV] 


1/3 


PrEL = [A [mm]? N,[mm ?]K?] (1.43) 


Electron densities in the compressed electron bunch entering the undulator are 
typically 10^ j.m^? (that is, 10° electrons in a volume of the order of 10? j.m?); the 
undulator period à, = 15-30 mm, and K-values of approximately 2-3 are normal. 
The LINACS produce electron energies of the order of 10 GeV. Because of the weak 
cube-root dependency of prz; on all the design variables except E, its spread is fairly 
narrow across all facilities designed or commissioned to date, and normally assumes 
values of around 5 x 107^. 

preL determines three important properties of high-gain XFEL radiation. Firstly, 
it describes the fraction of the electrons' power converted at SASE saturation to 
photon power, that is 


Pon 
P, ` 


PFEL = (1.44) 


The larger is prgr, the more efficiently electron energy is converted into photon 
energy. preL also describes the relative spectral BW at saturation, 


Aw 
PrEL = — , (1.45) 
w 
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which means that at photon energies of the order of 10 keV, the instantaneous BW 
(that is, the BW of any one FEL pulse) is of the order of 5 eV. This is over an order of 
magnitude larger than the transmitted BW after monochromatization with a diamond 
(004) single crystal. In addition, the stochastic nature of the initial spontaneous 
generation of SASE radiation is responsible for the central position of the SASE 
spectrum jittering from pulse to pulse by a few tens of eV. This is acceptable for 
many types of experiment, such as in serial femtosecond crystallography, but can 
be problematic in others, especially in spectroscopy. The bandwidth can be reduced 
in several ways including self-seeding, or the use of chicanes in so-called high- 
brightness SASE. It lies outside the remit of this overview to detail these here [3]. 

In contrast to the transverse coherence length, the longitudinal coherence length 
of SASE pulses [which depends inversely on the BW, see (1.14)] is small, of the 
order of 200 nm in the hard X-ray regime. 

Thirdly, the distance Lg within an XFEL undulator required to obtain a gain in 
SASE radiation by a factor e ~ 2.72 is inversely proportional to prey and is given 
by 


hu 


bue. 
Az A/3 PrEL 


(1.46) 


Inserting our known values for 4,, and org; , we obtain values for Lg of the order of 
a few metres. Saturation of SASE occurs after a gain of approximately five orders of 
magnitude. But 10? ~ e!!, and hence the total undulator length should exceed Lg 
by a factor of 11 or more. The LCLS hard X-ray undulator is over 110 m long. 

Hard XFEL radiation can therefore directly track atomic motions in condensed 
matter and vibrations with periods of the order of picoseconds down to tens of 
femtoseconds, a hitherto inconceivable scientific endeavour. XFELs deliver peak 
brilliances many orders of magnitude greater than radiation from third- and fourth- 
generation synchrotron sources, 100% transverse coherent radiation, and pulse dura- 
tions typically of a few tens of femtoseconds, but which can also be tailored to be 
less than one femtosecond [21, 22]. 


1.5.3 Concluding Remarks 


The main objective of this section was to summarize the machine physics of XFELs 
in a manner that is accessible to a wide spectrum of potential users. XFEL tech- 
nology and science are bound to further develop and morph in the next decades, 
as, in contrast to synchrotron science and technology, XFELs are still very much in 
their infancy. Nonetheless, they are maturing rapidly, thanks in no small part to the 
knowledge base already established for synchrotron facilities. Because the radiation 
produced by XFELs differs so greatly from SR in the peak brilliance and associated 
very short pulse durations, experimental methods commonly used at synchrotrons 
need more often than not to be entirely rethought in order to be carried out at XFELs; 
indeed, some experimental methods at synchrotrons are wholly excluded at XFELs. 
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On the other hand, some experimental techniques first developed at XFELs, in par- 
ticular serial femtosecond crystallography, are now being adopted and adapted for 
synchrotron light. XFELs should thus not be considered as superior alternatives to 
synchrotrons, but more as complementary facilities. 

Many experimental disciplines, particularly in the fields fs-time resolved studies 
and nonlinear light-matter interactions, have emerged using XFELs and are still 
rapidly evolving; this is the reason, in fact, why XFELs were developed in the first 
place. It will be intriguing to see how these develop and expand in the forthcoming 
years and decades. The future of photon science indeed looks bright! 


1.6 Summary 


This chapter has introduced the reader to the basic physics of radiation produced 
by accelerated electrons and the recent advances in machine technologies which 
have resulted in the development of DLSRs and XFELs. The number of dedicated 
X-ray sources has burgeoned over the last three decades and has opened the field of 
synchrotron science, transforming it into a multidisciplinary enterprise, nowadays 
attracting some hundred thousand scientists across the broadest spectrum of the 
natural sciences. 
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Chapter 2 A) 
Concepts in Magnetism get 


Stephen J. Blundell 


Abstract I review some general concepts in magnetism including the nature of 
magnetic exchange (direct, indirect and superexchange), and how exchange interac- 
tions play out in multiple spin systems. The nature of atomic orbitals and the way 
in which they interact with the spin system is also considered. Several examples are 
also treated, including the Jahn- Teller interaction and its role in the properties in 
layered manganites. 


2. Introduction 


Magnetic properties are found in a wide variety of materials. In order to explain mag- 
netism we need to consider a range of different behaviours in many different types 
of magnetic system. Consider the following: Fe and Ni are both metallic elements 
and exhibit ferromagnetism; MnO is an insulating oxide with a three-dimensional 
antiferromagnetic structure; La;CuO, is a layered material which exhibits antifer- 
romagnetism but, when doped, becomes superconducting; some compounds do not 
order magnetically but show frustrated effects with an abundance of slow dynamics; 
some molecules become single-molecule magnets in which the individual molecules 
show quantum tunnelling of magnetization and a range of other interesting properties. 
Theories of magnetism have to explain all these materials and more. 

For a start, we must realize that a classical approach will not work. The Bohr-van 
Leeuwen theorem [1] states that in a classical system there is no thermal equilib- 
rium magnetization. We can prove this in outline as follows: in classical statistical 
mechanics the partition function Z for N particles, each with charge q, is propor- 
tional to 


J [o f vleet ol dri ---dry dp,---dpy , (2.1) 
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where 6 = 1/kgT, kg is the Boltzmann constant, T is the temperature and i = 
1,..., N. Here E({r;, p;}) is the energy associated with the N charged particles 
having positions rj, 72,..., ry, and momenta p,, p». ..., py. The integral is, there- 
fore, over a 6N -dimensional phase space (3N position coordinates, 3N momentum 
coordinates). The effect of a magnetic field is to shift the momentum of each particle 
by an amount qA. We must, therefore, replace p; by p; — qA. The limits of the 
momentum integrals go from —oo to +00 so this shift can be absorbed by shifting 
the origin of the momentum integrations. Hence the partition function is not a func- 
tion of magnetic field, and so neither is the free energy F = —kgT log Z. Thus the 
magnetic moment m = —(0 F/0B)r must be zero in a classical system. 

Thus, we need quantum mechanics to make further progress. In this chapter, I 
will not provide an exhaustive review of magnetism (fuller treatments can be found 
elsewhere, e.g. [2—5]) but focus on a few key issues and some selected examples. 
To begin our discussion, it is helpful to note the energy scales inherent in magnetic 
problems. First, there is the kinetic energy which is on the eV scale. This typically 
takes a value like /?z?2/(2mL?), where L is a length scale and this expression is 
the familiar one for particle in a box. This is an energy cost and arises because it 
takes energy to put an electron in a small box. Second comes the potential energy 
which is also on a similar scale and takes a form such as e? / (4x «o L). This will be 
a negative energy if considering the attraction between an electron and a nucleus 
(and becomes larger and more negative as L decreases) and positive if considering 
electron-electron repulsion. Atoms are the size they are because of a compromise 
between kinetic energy wanting the atom to be infinite size and the potential energy 
wanting the atom to be zero size. Because one energy goes as L7? and the other as 
—L- a compromise can be reached (and this is essentially the derivation of the Bohr 
radius). Both the kinetic and potential energies are large and are typically >> kpT. 
Next, we have to add the spin-orbit interaction which is typically much smaller, 
usually in the meV, and the magnetocrystalline anisotropy, which in cubic materials 
is in the weV. These effects will turn out to be very important in magnetic materials, 
but they are small perturbations to the main interactions and will mainly come into 
play only once the magnetic order is established by the dominant interactions. 


2.0 Exchange 


The exchange interaction arises from the kinetic and potential energy in bonds 
between atoms. To see how this comes about, we begin by recalling simple results 
for the molecular orbitals in Hz [see Fig.2.1a]. We label the two hydrogen atoms A 
and B and write the wave function |W) as a linear combination of atomic orbitals 
| VA) and |Vg) so that 

Iv) =calWa) + calVrs) - (2.2) 
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Fig. 2.1 a Two hydrogen (a) lva) lwp) (b) 

atoms A and B can lower o* 
their energy by forming a 

hydrogen molecule H2. 

b The bonding and 


antibonding molecular 


orbitals o and o* o 


The Hamiltonian can be written as a sum of the kinetic energy and two terms for the 
potential energy due to the attraction to each hydrogen nucleus so that 


5 h 
H = -— V? + Va + Vp. (2.3) 
2m 


We then need to solve the equation Hw) = E|w). The diagonal integral Eo, which 
can be approximated by the binding energy of the electron at one of the centres for 
a hydrogen atom, is given by 


Eo = (WalHlWa) . (2.4) 


The transfer integral (also known as the hopping integral or resonance integral) ft is 
given by . 
t = (VD Vs) . (2.5) 


In the simplest approximation (the Hückel approximation), the overlap integrals are 
given by 5;; = (vVi|V;) = à;; and hence the secular equation | H;; — ES;;| = 0 can 
be written as 


=0, (2.6) 


Eo — E t 
t Eo — E 


and hence 
E = Eott. (2.7) 


The eigenfunctions for these solutions are the symmetric solution 


+ 
o) = A) + lv) dies 
J2 
which costs energy Eo — t and the antisymmetric solution 
ies IVa) — |Vs) (2.9) 


V2 


which costs energy Eo + t. These are known as the bonding and antibonding states, 
respectively [see Fig. 2.1b]. The hydrogen molecule has two electrons so the ø level 


42 S. J. Blundell 


is full and the o* level is empty, thus saving the energy overall and leading to the H2 
molecule being a stable entity. The molecule He» does not form because it has four 
electrons and would, therefore, involve filling both o and o* and thus saves no energy 
(and in fact, outside the Hückel approximation, it turns out that o* — Eo > Ey — o 
and so helium bonding costs more energy than it saves). 


2.2.1 Direct Exchange 


Exchange interactions are nothing more than a consequence of electrostatic inter- 
actions and the familiar interplay between potential energy and kinetic energy that 
we see in chemical bonds. Consider a simple model with just two electrons which 
have spatial coordinates rı and r2, respectively. The wave function for the joint state 
can be written as a product of single electron states, so that if the first electron is in 
state V; (r1) and the second electron is in state v; (r2), then the joint wave function 
is Wa(r1)Wo(r2). However, this product state does not obey exchange symmetry, 
since if we exchange the two electrons we get v; (r2) V, (r1), which is not a multiple 
of what we started with. Therefore, the only states which we are allowed to make 
are symmetrized or antisymmetrized product states which behave properly under the 
operation of particle exchange. 

For electrons, the overall wave function must be antisymmetric so the spin part 
of the wave function must either be an antisymmetric singlet state xs (S — 0) in the 
case of a symmetric spatial state or a symmetric triplet state xz (S = 1) in the case 
of an antisymmetric spatial state. Therefore, we can write the wave function for the 
singlet case Ws and the triplet case Wr as 


1 
Ws = altar vera) + Walr2)Wo(r1)] xs 


1 
Y= Jal simon) — Walr2) Wor) xr . (2.10) 


where both the spatial and spin parts of the wave function are included. The energies 
of the two possible states are 


Es = [srs dr; dr? 
ET = EL dr; dr? E 


with the assumption that the spin parts of the wave function xs and xr are normalized. 
The difference between the two energies is 


Es — Er = af War VE 2) HWalr2) Wor) dri dra . (2.11) 
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For a singlet state $1. S; = -i while for a triplet state Sı > $2 = i Hence the 
Hamiltonian can be written in the form of an ‘effective Hamiltonian’ 


~ 1 
H= 4 (Es + 3Er) — (Es — Er)S1 - $5. (2.12) 


This is the sum of a constant term and a term which depends on spin. The constant can 
be absorbed into other constant energy terms, but the second term is more interesting. 
The exchange constant J is defined by 


Es — ET E " ^ 
pee f Vs mn) G2) Hrs G2) Wor) dr dr? , (2.13) 


and hence the spin-dependent term in the effective Hamiltonian can be written as 
Ferm = —2J S1 - S2 . (2.14) 


If J > 0, Es > Er and the triplet state S = 1 is favoured. If J < 0, Es < Er and 
the singlet state S = 0 is favoured. Thus, the exchange interaction compares two 
different configurations that are tied to the singlet and triplet spin states, but the 
energy difference associated with exchange comes from the difference in those two 
configurations worked out from an integral [see (2.13)] over the spatial coordinates. 
Thus the spins are really there just to label the two different spatial states and are 
inextricably tied to the spatial wave functions by the Pauli principle; the exchange 
interaction is really between spatial wave functions, even though we tend to think 
about it as between the spin parts that really just come along for the ride! 
Equation (2.14) is relatively simple to derive for two electrons, but generalizing to 
a many-body system is far from trivial. It motivates the Hamiltonian of the Heisenberg 
model: . 
H--» s.s. (2.15) 


ij 


where J;; is the exchange constant between the ith and jth spins. The factor of 2 is 
omitted because the summation includes each pair of spins twice. Another way of 
writing (2.15) is 

fi 2-29 4S: -S;, (2.16) 


i>j 


where the i > j avoids the ‘double-counting’ and hence the factor of two returns. It 
is worth noting that there are different conventions for the definition of J that are in 
use in the literature. I call these the J-convention and the 2 /-convention and they are 
summarized in Fig.2.2. Note that it is also possible to choose the sign of J so that 
J > 0 means ferromagnetic (as here) or antiferromagnetic. Both choices are found 
in the literature. 
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Fig. 2.2 The two different 
conventions used for the 


definition of J. In this “J convention” “2J convention” 
chapter (and in [2]), we are 
using the 2J convention (so J 2J 


that 2J is the energy 
associated with a single 
pairwise interaction between 
two spins). The various 
alternative expressions that 
one can use for the many spins: many spins: 
Heisenberg interaction are —2 y J45S S; 
; iu. t0 7] 
shown under the heading 1j 
‘many spins’, as well as an e JijSi i Sj 
ij i 
=J 2 S i" S j 
(ij) 


2 spins: 2 spins: 
—JS1 -S2 —2JS1 -S2 


expression for a 
one-dimensional (1D) chain 
of spins 


—-JxS;-Si4 (1D) -2J £ Si; Si 4 (1D) 
1 1 


2.2.2 Indirect Exchange 


Ifthe electrons on neighbouring magnetic atoms interact via an exchange interaction, 
this is known as direct exchange. This is because the exchange interaction proceeds 
directly without the need for an intermediary, and this was considered in the previous 
section. 

Very often direct exchange cannot be an important mechanism in controlling the 
magnetic properties because there is insufficient direct overlap between neighbouring 
magnetic orbitals. For example, in rare earths the 4 f electrons are strongly localized 
and lie very close to the nucleus, with little probability density extending significantly 
further than about a tenth of the interatomic spacing. This means that the direct 
exchange interaction is unlikely to be very effective in rare earths. Even in transition 
metals, such as Fe, Co and Ni, where the 3d orbitals extend further from the nucleus, 
it is extremely difficult to justify why direct exchange should lead to the observed 
magnetic properties. These materials are metals which means that the role of the 
conduction electrons should not be neglected, and a correct description needs to be 
taken into account of both the localized and band character of the electrons. 

In metals the exchange interaction between magnetic ions can be mediated by the 
conduction electrons. A localized magnetic moment spin-polarizes the conduction 
electrons and this polarization in turn couples to a neighbouring localized magnetic 
moment a distance r away. The exchange interaction is thus indirect because it does 
not involve direct coupling between magnetic moments. It is known as the RKKY 
interaction (or also as itinerant exchange). The name RKKY is used because of the 
initial letters of the surnames of the discoverers of the effect, Ruderman, Kittel, 
Kasuya and Yosida [6—8]. The coupling takes the form of an r-dependent exchange 
interaction Jgkky (r) given by 
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Fig. 2.3 The real space susceptibility of a free electron gas is given by 
x(r) = 2k Moe eg (ER) (2kpr)/7 where F(x) = (—x cos x + sin x)/x* is the function illus- 
trated. A localized spin in a free electron gas, therefore, gives rise to an effective exchange 
JgRKkKY x F(2kpr) and this is proportional to cos(2kgr)/ r? when r > kg l 


cos(2kpr) 
Jrxxy(r) x mE (2.17) 


at large r (assuming a spherical Fermi surface of radius kp). The interaction is long 
range and has an oscillatory dependence on the distance between the magnetic 
moments (see Fig.2.3). Hence depending on the separation it may be either fer- 
romagnetic or antiferromagnetic. The coupling is oscillatory with wavelength 7 / kp 
because of the sharpness of the Fermi surface. 


2.2.8 Superexchange 


A number of ionic solids, including some oxides and fluorides, have magnetic ground 
states. For example, MnO [see Fig. 2.4a] and MnF> are both antiferromagnets, though 
this observation appears at first sight rather because there is no direct overlap between 
the electrons on Mn?* ions in each system. The exchange interaction is normally 
very short ranged so that the longer ranged interaction that is operating in this case 
must be in some sense ‘super’ (think of Superman leaping over buildings, a skill not 
afforded to ordinary mortals). 

The origin of superexchange is the possibility of mixing in excited states to lower 
the energy. The favouring of antiferromagnetic superexchange in a linear Mn-O-Mn 
bond arises from the fact that the excited states are allowed, while for the ferromag- 
netic arrangement these excited states are forbidden [see Fig. 2.4b]. One can consider 
this problem with a toy model based on a Hubbard-style Hamiltonian (see, e.g. [9]) 
which may be written as 


A = -t X 8,6 UM Aint . (2.18) 


(ijjo 
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Fig.2.4 a The crystal structure of MnO. Nearest neighbour pairs of Mn?* (manganese) ions are 
connected via O?- (oxygen) ions. b A simple model of superexchange for a Mn-O-Mn bond. 
I: the antiferromagnetic ground state with opposite spins on the two Mn ions and a pair of electrons 
on the oxygen anion. II and III: two excited states of the antiferromagnetic ground state in which 
the electrons from (T) hop back and forth. IV: the competing ferromagnetic ground state. This is 
energetically more costly because the excited states analogous to (II) and (IID are not available 
because of the Pauli exclusion principle 


where the first sum is over nearest neighbours; thus energy is lowered by hopping 
(the first term on the right) but there is an energy penalty for double occupancy (the 
second term on the right) due to the Coulomb repulsion energy U. Let us now restrict 
this model to a system with two possible sites for electrons (here we are ignoring the 
intermediate oxygen to for simplicity). We can start by putting a single electron with 
spin ^ into the system. Using a basis | 7, 0) and |0, 7), the Hamiltonian is given by 


^ 0 —t 
H= e 0 ) ; (2.19) 


because with one electron there is no possibility of a Coulomb penalty, and so the 
only energy to worry about is the energy saving you get from hopping. This is the 
same as the H5 problem we considered earlier and the eigenvalues are +? and so the 
lowest energy state is the bonding state, just as before. 

Now let us put a second electron into the system with opposite spin to the first. 
Now using a basis such that a general state can be written as 


I) =al 4,0) E bI. D E cl T) + 410.13) . (2.20) 


and we can easily show that in this basis the Hubbard Hamiltonian is 


Ut-t 0 
^ too t 
Ld TT ML (2.21) 


O0 t — U 
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where the minus signs appear because of the exchange symmetry. The eigenvalues 
are 0, U and (U/2) + y (U/2)? + 21?, so in the limit that t/U «& 1 the last pair 
of eigenvalues are U + 2t7/U + O(t*^/U?) and —212/U + O(t*/U?). Thus, the 
ground state has energy —21?/ U . If we try the same problem again with two electrons 
with the same spin then they cannot sit on the same site because of the Pauli exclusion 
principle. Thus the only state possible is | ^, +) and this has energy E = 0. Thus, 
there is an energy saving in having the two electrons with opposite spin because you 
can go lower than E = 0 and have E = —21?/U + O (t^/ U?). This means that the 
exchange interaction has a magnitude J ~ 27?/U. The moral of the story is that by 
having the possibility to mix in the higher energy states in which two spins sit on the 
same site (costing U), itis possible to lower the overall energy. The antiferromagnetic 
arrangement allows this process to happen; the ferromagnetic arrangement forbids it. 
Superexchange can be considered in more detail [10] and can in certain circumstances 
be ferromagnetic. The size and sign of the superexchange interaction is codified in 
the Goodenough-Kanamori-Anderson rules [11—14]. 


2.3 Consequences of the Heisenberg Exchange Interaction 


We have seen that at the heart of the exchange interaction is a term S, -$ b, a simple 
scalar product between two spin operators. If that scalar product is expanded, we 
have 


where the raising and lowering operators $+ and $~ are defined by 


St 2 $* i$" 
$74 -i1. (2.23) 


Although the term $: S in (2.22) seems to be simple enough to handle, the term 
(St a + 5 St ) will give rise to flip-flop processes in which simultaneously an 
up-spin labelled a is lowered and a down-spin labelled b is raised, or vice versa. This 
part of the interaction has profound effects. 


2.3.1 Two Interacting Spin-1 Particles 


In this section, we will consider two spin-5 particles coupled by a scalar interaction 
described by a Hamiltonian H = AS, . Sp, where Si and $ p are the operators for the 


a tot a a 
spins for the two particles. We can also write the total spin operator S$ = Sa + S; 
so that 
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Table 2.1 The eigenstates of Ša - Sp fora two-spin system and the corresponding values of ms, s 
and the eigenvalue of S, - Sp 


Eigenstate m; sS Sa- Sp 
I1 l l 7 
FY) 2d UD 1 
0 1 1 
NZ 4 
n -1 1 i 
FY) mE Yt) 3 
0 0 3 
J2 4 
a tot > A2 A2 A A 
(S) = (Sa) + (Sb) -28,. So. (2.24) 


In quantum mechanics, when you combine the angular momentum of two spin- 
i particles you have the 'addition law' that 5 + 5 = 0, 1. You can think of this 
simply as arising from the fact that you can combine the two moments together 
constructively or destructively. Alternatively, imagine adding two classical vectors J, 
and J2 together but varying the angle between them. In that case, the resulting vector 
Jı + J2 would have length ranging from | J; — J2| to J; + J2 (where J; = | J; | and 
Jy = | J2|). More formally, combining the representations of two spin-4 together 
yields a representation 


DO @D® 2 pO e pO , (2.25) 


In other words, the result of combining two spin- 7 particles is a combined object 


with spin quantum number s = 0 or 1. The eigenvalue of (S^ is s(s + 1) which 
is therefore either O or 2 for the cases of s = 0 or 1. The eigenvalues of both (S) 
and ($5)? are i. Hence from (2.24) 


e e _ftzif s=1 
$$ - ftii uiae (2.26) 


The system, therefore, has two energy levels for s — 0 and 1 with energies given by 


+4 if s=1 
E= MP aa (2.27) 
5 =0. 


The degeneracy of each state is 2s + 1, so that the s = 0 state is a singlet (a single 
energy level) and the s = | state is a triplet (three energy levels). The z component of 
the spin of this state, m,, can only equal 0 for the singlet, but can be —1, 0, or 1 for the 
triplet. Thus the product of two spin-j representations, which have a dimensionality 
of 2 x 2 — 4, gives rise to states s — O (singlet) and s — 1 (triplet), which have a 
total dimensionality of 1 + 3 = 4. 
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We have considered the eigenvalues of Si . S, but what about the eigenstates? The 
most straightforward basis to consider is 


Itt), IM), Wt. 44). (2.28) 


The first arrow refers to the z component of the spin labelled a and the second arrow 
refers to the z component of the spin labelled b. The eigenstates of Sa -$ p are linear 
combinations of these basis states and are listed in Table 2.1. The value of m, is equal 
to the sum of the z components of the individual spins. Since the eigenstates are a 
mixture of states in the original basis, we cannot know both the z components of the 
original spins and the total spin of the resultant entity. This is a general feature which 
will become more important in more complicated situations. 

The basis in (2.28) also fails to satisfy the condition that the overall wave function 
must be antisymmetric with respect to exchange of the two electrons. Since the wave 
function is a product of a spatial function pace (r1, r2) and the spin function x, 
the spatial wave function can be either symmetric or antisymmetric with respect to 
exchange of electrons. For example, the spatial wave function 


$(ri)&(ra) x ó(rz)&(ri) 
J2 


is symmetric (4-) or antisymmetric (—) with respect to exchange of electrons depend- 
ing on the +. This type of symmetry is known as exchange symmetry. In (2.29), 
$ (r;) and &(r;) are single-particle wave functions for the ith electron. Whatever is 
the exchange symmetry of the spatial wave function, the spin-wave function x must 
have the opposite exchange symmetry. Hence x must be antisymmetric when the 
spatial wave function is symmetric and vice versa. This is in order that the product 
Vspace (1, F2) X x is antisymmetric overall. 

States such as | t+) and | ||) are clearly symmetric under exchange of electrons, 
but exchanging the two electrons in | ^ |) yields | JT) which is not a multiple of 
| 4). Thus | f 4), and also by an identical argument | |, 1), are both neither symmetric 
nor antisymmetric under exchange of the two electrons. The true eigenstates must, 
therefore, be linear combinations of these two states (see Table 2.1). The state (| 7 |.) + 
| t))//2 is symmetric under exchange of electrons (in common with the other two 
s = 1 states) while the state (| 4) — | t))/2 (the s = 0 state) is antisymmetric 
under exchange of electrons. 

The energy levels are shown in Fig.2.5. Without the flip-flop term TO 5 + 


bs 3 ) in 2.22), the Hamiltonian is simply Å= AS: (S p and this leads to two degen- 
erate doublets as shown. The upper doublet (consistiag of the states | ^ 1) and | | 4)) 
is unchanged when the flip-flop terms are switched on. The lower doublet (consist- 
ing of the states | ^J) and ||‘), although strictly speaking it should of course be 
the symmetric and antisymmetric combinations of these two states) splits with the 
addition of the flip-flop terms to make H = AS, - Š, and the symmetric combination 


V'space (11, r2) = (2.29) 
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Fig. 2.5 The energy levels for the two Hamiltonians: Å= ASE $i and H = AS, G $o 


rises up in energy to A/4 and the antisymmetric combination is lowered to —3A/4, 
becoming the ground state when A > 0. 


2.3.2 A Chain of Spins 


Let us now consider not just two spins but a one-dimensional ferromagnetic chain of 
spin-5 moments described by the Heisenberg model. In one dimension, the Hamil- 
tonian for the Heisenberg model can be written as 


A AA LEA A E 
A= -u Y 8 56555 $5820] (2.30) 


where J > 0. The ground state ® consists of all spins aligned [see Fig. 2.6a] and this 
is an eigenstate of H so that H|®) = —NS?J|®). Now to create an excitation, let 
us flip a spin at site j, so let us now consider a state 


lj) = 5; |®) (2.31) 
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Fig. 2.6 a The ground state of a one-dimensional ferromagnet in the Heisenberg model is |®). An 
excited state | j} has a single flipped spin at site j. The spin-wave state |q} is a delocalized spin flip. 
b The dispersion relation for the spin waves 


which is simply the ground state with the spin at site j flipped [see Fig.2.6a]. By 
flipping a spin, we have changed the total spin of the system by i —í(— 5) = |. This 
excitation, therefore, has integer spin and is a boson. If we apply the Hamiltonian to 
this new state, we get 


Ali) -2[CNS*J + 2SJ)|j) - SJlj*- 2) - Sj - 0]. (2.32) 


which is not a constant multiplied by |), so this state is not an eigenstate of the 
Hamiltonian. Nevertheless, we can diagonalize the Hamiltonian by looking for plane 
wave solutions of the form 

iqR 


1 
= - iy. 2.33 
lq) = JN 2 ej) (2.33) 


The state |g) is essentially a flipped spin delocalized (smeared out) across all the 
sites [see Fig. 2.6a] and is known as a spin wave or a magnon. The state |q) is also 
an eigenstate of an operator exchanging any two spins, which is not the case for |). 
Since |q} is a linear combination of states like | j} which represent a single flipped 
spin, the total spin in the z-direction of |q} itself has the value N.S — 1. It is then 
straightforward to show that 


^ 


^q) = E(q)lq) . (2.34) 


where 
E(q) — —2NS?J + 4JS(1 — cos qa) . (2.35) 


The energy of the excitation is then w = 4J S(1 — cos qa) and is plotted in Fig. 2.6b. 
At small q, ñw ~ 2J Sq?a?, so that w « q?. In three dimensions, the density of states 
is given by g(q) dq « q? dq, which leads to 
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g(w)dw « w! dw (2.36) 


at low temperature where only small q and small w are important. The spin waves 
are quantized in the same way as lattice waves. The latter are termed phonons, and 
so in the same way the former are termed magnons. They are bosons and have a spin 
of one. 

The number of magnon modes excited at temperature T , nmagnon, is calculated by 
integrating the magnon density of states over all frequencies after multiplying by the 
Bose factor, [exp(fio/ kg T) — 1]^!, which must be included because magnons are 
bosons. Thus the result is given by 


n = f "Eda (2.37) 
m^ Jo expÜio/kgT) — 1’ 


which can be evaluated using the substitution x = hw/kgT. At low temperature, 


where g(c) c c? in three dimensions, this yields the result 
kpT 3/2 poo y 1/2 dy 3 

magnon = | 7 - T n . 2.38 

"mag ( h ) Í e*—] x Pn 


Since each magnon mode which is thermally excited reduces the total magnetization 
by one (because each magnon mode is a delocalized single reversed spin), then at low 
temperature the reduction in the spontaneous magnetization from the T = 0 value is 
given by 

MI —M T32. (2.39) 

M(0) 

This result is known as the Bloch T?" law. If one repeats this calculation in two 
dimensions (rather than three), the integral diverges, showing that magnons spon- 
taneously form at all non-zero temperatures, thereby destroying any magnetization. 
The impossibility of spontaneous magnetization in two dimensions for the Heisen- 
berg model is known as the Mermin-Wagner theorem [15-17] (see also [9]). 


2.3.3 Three Spins 


Let us return now to an apparently simpler system and consider three spins on the 
corners of an equilateral triangle. We will put the exchange interaction to be negative 
and thus the system is frustrated. If we put the first spin up, the next one down, 
then we have a dilemma of how to arrange the third one because we cannot satisfy 
the antiferromagnetic interactions on every bond. The solution has to be one of 
compromise and in fact the ground state of the classical Heisenberg model on a 
triangle is the so-called 120° state shown in Fig. 2.7a. 
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(a) (b) 


Fig.2.7 The ground state of the classical Heisenberg model is the 120? state shown in (a), though 
a version with opposite chirality (b) is also possible. If each of the three spins are rotated by a 
constant angle [(c) and (d)] then additional ground state configurations can be obtained 


In fact, there are some other possible solutions since we can choose to wind the 
spins round the triangle in two different ways. The configuration in Fig. 2.7b also has 
a 120? angle between adjacent spins but has the opposite chirality to that of Fig. 2.7a. 
Moreover, the Heisenberg model only cares about the relative angle between spins, 
not their absolute orientation, and therefore the configurations in Fig.2.7c and d are 
also part of the ground state manifold. 

Let us now solve the problem quantum mechanically. The law of addition of 
angular momentum now gives 1 + 5 + i = 1 1 3. Now three two-dimensional 
representations (for three spin-+) have a dimensionality of 2? = 8 which is equal 
to two two-dimensional representation and a four-dimensional representation (for 
two spin-4 and a single spin-3, so 2? = 2 + 2 + 4). Another way of writing this 
combination is 

D? @ D? @ D? 2D? @ D® | (2.40) 


For three spins we have that 


^t 


S = $i + $ + $3 (2.41) 


and hence 
atot o ^2 ^2 ^2 ^ ^ 
($ yz$ 28,128,429: $,. 8j, (2.42) 
(is) 


and so using the facts that the eigenvalue of (S^ is S'* (S'** + 1) and the eigenvalue 
a2 
of S; is iG +1)= i we have 


€ ^ 1 tot y qtot 3 
BEXELI S^ (S six]. (2.43) 
(i,j) 


We have two cases: (i) S'^* = 3 implies that a Si- $; = 3; (ii) SS! = 5 implies 
that wa Si- Sj = -i. The energy levels are drawn in Fig.2.8 and consist of two 
degenerate doublets at E = —3A/4 (S = D and a quartet at E — 3A/4. 
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Fig. 2.8 The energy levels E 
for a triangle of spins consist A 
of two degenerate doublets quartet 
and a quartet 5 
3A E (0) (0) 
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two doublets 
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It is interesting to consider the 8 states that make up these energy levels [18]. We 
can write these as follows 


Ios) = 1444) 
Wc 
Rape en PCS i 
es 
1 2 
y epee 2rijk/3 çi 
Lo Bo SITA) 
Iac) = IW) (2.44) 


Two states are obvious. These are the ‘ferromagnetic’ configurations | ^77) and 
|444). The other six contributions have a single spin-flip with respect to these 'fer- 
romagnetic configurations’ and so are made up of states like | | ++). However, a state 
like | | 1 1) is not exchange symmetric or antisymmetric, so you have to make linear 
combinations such as | L4 1) + |141) +1114). This is exactly what is achieved by 
the sums in (2.44) (and note that the 1/ /3 factor is simply a normalization). The 
operators C ; are threefold rotations of order j, and j, k = 0, 1, 2. 

What is more, we can recover the chiral nature noted in the classical solutions. If 
we define a chirality operator Ĉ, given by 


^ 1 
C; = a . (S5 x $3) E (2.45) 
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then our states are eigenstates of the chirality operator 


Ĉ.|Yy-3) =0 

CP )- 

CV.) = Wye) 

C&JwO ,j 2 —[ 1). (2.46) 


Note that the quantum numbers k = 0, 1, 2, and chirality C; = 0, +1 describe the 
same states [18]. The states with non-zero chirality are in the pair of doublet ground 
states (for A > 0) while the excited state quartet has zero chirality. 


2.4 Orbitals 


2.4.1 Transition Metal Ions 


The Heisenberg model only depends on the relative orientation of spins, not on their 
absolute orientation. It, therefore, seems to take no account of the lattice in which 
spins are embedded. However, spins are ‘aware’ of the lattice via the spin-orbit 
interaction. We now turn to consider the electronic orbitals that may be occupied 
in real systems. We will here particularly focus on transition metal compounds in 
which localized moments occur. (For metallic systems a band-like description would 
be more appropriate.) 

Commonly occurring first-row transition metal ions are shown in Fig. 2.9 (taken 
from [19]), illustrating the range in occupancy of the 3d shell that can be obtained 
using transition metal ions in different oxidation states. I also show their electronic 
configuration for the case in which the ion experiences an octahedral crystal field 
(which splits the ten d-levels into a sixfold f2, level and a fourfold e, level [2]). 
Octahedral environments are very common in a wide variety of compound. This 
figure also shows that some ions are very particular about their oxidation state; for 
energetic reasons, Sc!!! and Zn" are the only stable states. On the other hand, an ion 
such as Mn can have a very large range of oxidation states from Mn" (in MnO) to 
Mn"! (in potassium permanganate KMnO4). In fact, with clever chemistry even Mn! 
is possible [20]! This range of occupancy leads to different spin and orbital moments 
of the ion and hence can be used to control the magnetic properties of the resulting 
system. For ions with d^, d?, d and d" configuration, there is the possibility of both 
low-spin and high-spin states. 
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Fig. 2.9 Electronic states of the d-electrons in octahedrally coordinated first-row transition metal 
ions in commonly found oxidation states. The d-shell contains 10 electrons and so configurations 
d9, d!, ..., d? are possible, indicated by the vertical lines. Depending on the oxidation state 
chosen (and not all possibilities are chemically available) these configurations can be achieved 
using different first-row transition metal ions (Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu, Zn). The oxidation 
state is indicated by the roman superscript, so that Cu" is copper in its doubly ionized oxidation state 
(i.e. Cu?*). An octahedral crystal field splits the ten d-levels into a sixfold 155 level and a fourfold 
eg level and the electron configurations are shown schematically at the bottom of the figure together 
with their description using the conventional spectroscopic notation (' A' signifies an orbitally non- 
degenerate state, ‘E’ doubly degenerate and ‘T’ triply degenerate). For ions with d^, d’, dÓ and d? 
configuration, there is the possibility of low-spin and high-spin configurations (Figure from [19]) 


2 Concepts in Magnetism 57 


2.4.20 Spin—Orbit Interaction and Crystal Fields 


Now that we have the possibility of some partially filled d-levels, there can be mag- 
netism. Next let us turn to the spin—orbit interaction which has the form 


^ 


AS E. (2.47) 


so that the $ operator acts on the spin part of the wave function, while the L operator 
acts on the spatial part of the wave function. If the states are approximately atomic 
states, and the spin-orbit interaction acts as a perturbation, one can focus on the 
A.S* L* part and note that 


E= -ih— , (2.48) 
which has eigenfunctions ei" i.e. 
f zaimý : ð imọ imọ 
Le"? = —ih—e™? = mie” , (2.49) 
ap 


Now the crystal field is a real potential which is due to electrostatic fields from 
neighbouring ions. The eigenfunctions of the crystal field cannot be proportional to 
e”? because we require real solutions. (Recall the problem of particle in a box where 
the solutions are real and take the form cos kx or sin kx, but not e*, but of course 
we can make real functions by making linear combinations such as e! + e^! * and 
making a wave function out of something proportional to that.) To make a crystal 
field state we must, therefore, look for linear combinations of eigenfunctions such 
as 

|y) = ein? 4 ecimó | (2.50) 


This kind of state though will automatically have zero angular momentum along the 
z-direction because it is made up of an equal contribution of a state with (L*) = mh 
and (L^) = —mh. In fact, this idea works for all directions and 


(L) 20. (2.51) 


For example, if / — 1, there are three states with m — 1,0, —1 with wave func- 
tions given by the spherical harmonics Y;,, (0, $), and are therefore proportional to 
sin 0 e?, cos 0 and sin 0 e~'®, respectively. We could write these states as |1), |0) and 
| — 1). However, these are not the famous p-orbitals familiar from chemistry books. 
These arise in the formation of chemical bonds due to (real) electrostatic effects and 
must, therefore, be linear combinations of exactly the kind we are talking about. 
Thus the p-orbitals that line up along the x-, y- and z-directions [see Fig.2.10a] are 
the zero-angular-momentum linear combinations given below: 
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Ory dzz dyz 


Fig. 2.10 Angular representation of a p-orbitals and b d-orbitals 


m [D + | — 1) 
l J2 
|Py) = et ad Sa 
A2i 
|Pz) —0). (2.52) 


What we see here is the phenomenon of quenching of angular momentum and it is 
interesting that it shows up even in the unfamiliar setting of the “balloon animals’ 
of p-orbitals. Of course in magnetism we are usually more interested in considering 
d-orbitals or f-orbitals but the same principles hold. For example, the equivalent 
equations for the d-orbitals [see Fig. 2.10b] are 


do) = AS 
dea) P 2E -2) 
peers 
a= a 1) 
|dz2}) = |0} . (2.53) 


2.4.3 Jahn-Teller Effect 


It can sometimes be energetically favourable for, say, an octahedron to spontaneously 
distort as shown in Fig.2.11 because the energy cost of increased elastic energy is 
balanced by a resultant electronic energy saving due to the distortion. This phe- 
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Fig. 2.11 The Jahn-Teller 
effect for Mn?* (34^). An 
octahedral complex (left) can 
distort (right), thus splitting 
the t2 and e, levels. The 
distortion lowers the energy 
because the singly occupied 
eg level is lowered in energy. 
The saving in energy from 
the lowering of the dyz and 
dyz levels is exactly balanced 
by the raising of the dyy level 


nomenon is known as the Jahn—Teller effect [21]. The distortion lowers the overall 
energy by breaking an orbital degeneracy. For example, Mn?* ions (which have a 
configuration 3d*) in an octahedral environment show this kind of behaviour (see 
Fig. 2.11) because the distortion can break the orbital degeneracy in the e, levels. In 
contrast, Mn^* ions (3d?) would not show this effect because there is no net lowering 
of the electronic energy by a distortion. 

To describe the effect, at least at the phenomenological level, we will assume that 
the distortion of the system can be quantified by a parameter Q, which denotes the 
distance of distortion along an appropriate normal mode coordinate. This gives rise 
to an energy cost which is quadratic in Q and can be written as 


zz 2 N2 
E(Q) = ;Mo*Q*, Q.54) 


where M and o are, respectively, the mass of the anion and the angular frequency 
corresponding to the particular normal mode. Clearly the minimum distortion energy 
is zero and is obtained when Q = 0 (no distortion). 

The distortion also raises the energy of certain orbitals while lowering the energy 
of others. If all orbitals are either completely full or completely empty, this does not 
matter since the overall energy is simply given by (2.54). However, in the cases of 
partially filled orbitals this effect can be highly significant since the system can have a 
net reduction in total energy. The electronic energy dependence on Q could be rather 
complicated, but one can write it as a Taylor series in Q and provided the distortion 
is small it is legitimate to keep only the term linear in Q. Let us, therefore, suppose 
that the energy of a given orbital has a term either +A Q or — A Q corresponding to a 
raising or a lowering of the electronic energy, where A is a suitable constant, assumed 
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to be positive. Then the total energy E (Q) is given by the sum of the electronic energy 
and the elastic energy 


E(Q) = +40 + 5 Mo? Q? f (2.55) 


where the two possible choices of the sign of the A Q term give rise to two separate 
curves. If we consider only one of them we can find the minimum energy for that 
orbital using 0 E/9 Q = 0 which yields a value of Q given by 


A 
= — 2.56 
Qo Mo? (2.56) 
and a minimum energy which is given by Emin = —A?/2Mo? which is less than 


zero. If only that orbital is full, then the system can make a net energy saving by 
spontaneously distorting. 

LaMnO; contains Mn in the Mn?* state which is a Jahn-Teller ion. LaMnOs 
shows A-type antiferromagnetic ordering. If a fraction x of the trivalent La*+ ions 
are replaced by divalent Sr?* , Ca?* or Ba?* ions, holes are introduced on the Mn sites. 
This results in a fraction 1 — x of the Mn ions remaining as Mn?* (34^, 229) and 
a fraction x becoming Mn** (34^, 15,60). When x — 0.2 the Jahn- Teller distortion 

vanishes and the system becomes ferromagnetic with a Curie temperature (Tc) 
around room temperature. Above 7c, the material is insulating and non magnetic, 
but below 7c, it is metallic and ferromagnetic. Particularly near 7c, the material 
shows an extremely large magnetoresistive effect which has been called colossal 
magnetoresistance. 

The situation is actually more complicated because the carriers interact with 
phonons because of the Jahn- Teller effect. The strong electron-phonon coupling 
in these systems implies that the carriers are actually polarons above Tc, i.e. elec- 
trons accompanied by a large lattice distortion. These polarons are magnetic and are 
self-trapped in the lattice. The transition to the magnetic state can be regarded as 
an unbinding of the trapped polarons. There are other signatures of the electron— 
phonon couplings, including magnetic-field dependent structural transitions and 
charge ordering. 


2.5 Conclusion 


This chapter has discussed a number of important concepts in magnetism. Clearly this 
has just scratched the surface and for more details the reader should look elsewhere 
[2—5]. Nevertheless, these principles are helpful in understanding the wide variety of 
magnetic materials that are being studied, from frustrated magnets [22] to molecular 
magnets [19, 23] and from permanent magnetic materials [24] to spintronics [25]. 
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Chapter 3 A) 
Electronic Structure Theory for X-Ray get 
Absorption and Photoemission 

Spectroscopy 


Peter Krüger 


Abstract The principles of X-ray absorption and photoemission spectroscopy cal- 
culations are introduced and the basics of electronic structure theory, including the 
Hartree-Fock approximation, density functional theory, its time-dependent version 
and quasiparticle theory are reviewed on an elementary level. Emphasis is put on 
polarization effects and the role played by electron correlation. 


3.1 Introduction 


In this chapter, the basic theory of X-ray absorption spectroscopy (XAS) and pho- 
toemission spectroscopy (PES) is introduced and popular computational methods 
are reviewed. Since XAS and PES mainly probe electronic excitations, a thorough 
understanding of electronic structure theory is mandatory. We shall review the stan- 
dard theoretical methods for ground state electronic structure calculations, namely, 
Hartree—Fock (HF) and density functional theory (DFT). Among the various excited 
state theories, we focus on time-dependent DFT and briefly touch upon Green's 
function quasiparticle methods and the Bethe-Salpeter equation approach. We do 
not discuss ligand-field atomic multiplet theory, because this important method for 
transition metal L-edge calculations is covered in Chap. 4. 


3.2 Light-Matter Interaction 


As light is an electromagnetic wave, it interacts with all charged particles. In the 
visible to X-ray regime, the interaction with the electrons hugely dominates the 
interaction with the atomic nuclei. We shall therefore disregard the nuclear degrees 
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of freedom in the following, focusing on the electronic state of the system. When 
an X-ray photon impinges on an atom it can be either absorbed or scattered, energy, 
momentum and spin being, of course, conserved in the whole process. In absorption, 
the photon vanishes: all its energy is transferred to an electron which is excited to 
an empty state above the vacuum level. In scattering, the energy of the photon can 
remain the same [elastic (Thomson) scattering]; it can also be partly transferred to the 
atom (inelastic scattering) as in Compton scattering, which leads to the ejection of an 
electron, or in a Raman-like scattering, in which the energy lost by the photon brings 
the atoms in an excited state, without any ionization. Neglecting relativistic effects 
and treating the X-ray field classically, the light-electron interaction is obtained 
by replacing, in the electronic Hamiltonian, the electron momentum operator p by 
p — eA/c, where A(r, t) is the vector potential of the light [1]. In Coulomb gauge 
(V -A = 0), the interaction Hamiltonian then becomes 


e 


A’. 3.1 
2mc? Bt) 


e 
Ha = ——A- p+ 
mc 


The first-order term in A describes light absorption and stimulated emission while 
the second-order term is responsible for (non-resonant) light scattering. Here we 
focus on the absorption process and neglect the generally much weaker A? term. 
First-order perturbation theory (Fermi golden rule) leads to the following expression 
for the absorption intensity from an initial state |i) of energy £;: 


Io) « 9 ^ lA - pli) (o — Ey + E) , (3.2) 
f 


where the sum runs over all possible final states | f) with energy E y. It is common to 
make the dipole approximation, i.e. to neglect the spatial variation of the X-ray field 
A (r). We may also replace the transition operator A - p, by e- r, where e = A/|A| 
is the light polarization vector and the change from p to r is possible by exploiting 
commutation relations between r, p and H, and the fact that |i) and | f) are energy 
eigenstates [2]. Equation (3.2) is often interpreted in a single-particle picture, in 
which case |i) is an atomic core state and | f) are unoccupied states above the Fermi 
level. However, electrons interact with each other through the Coulomb interaction, 
such that the excitation of one electron affects the motion of the others. Therefore, 
the correct use of (3.2) is in a many-particle sense, where |i) = |®,) is the many- 
electron ground state, and | f) = |o ;) are many-electron excited states with a core 
hole. Putting the constants we have 


I (o) = An^aho Y (ple: Y rj], 5o — Ey + Ey), (3.3) 
f j 


where œ = e? /ħc is the fine structure constant and j counts the electrons. Having 
established the expressions of the absorption intensity, the remaining task is to calcu- 
late the eigenstates of the (unperturbed) electronic system, |®,) and |® +), and their 
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energies. Thus, the main theoretical problem of XAS is the accurate description of 
the electronic structure of the system, both for the ground and core-excited states. 
We, therefore, start by reviewing the basics of (ground state) electronic structure 
theory before turning to the specific methods for handling core-excited states. 


3.3 Ground State Electronic Structure Theory 


Consider N electrons interacting with each other and the atomic nuclei. Following 
the Born-Oppenheimer approximation, we neglect the coupling between the nuclear 
and electronic dynamics. For the electronic problem, this means that the nuclei are 
at fixed positions and can be described by a static external potential V4, (r). The 
electronic Hamiltonian is then given by 


H=T + Voc + Ve =) PY Var) Y , (3.4) 

i i i<j 
where i, j count the electrons and atomic units are used (ñ = m = e = 1). The 
kinetic energy T and the external potential V.,, are one-particle operators whereas 
the electron-electron interaction Vee is a two-particle operator. Because of Vse, the 
electronic motion is correlated and the many-electron problem cannot be solved 
exactly (except for a few electrons). Drastic approximations need to be made. The 
most important ground state electronic structure methods are HF and DFT. 


3.3.1 Hartree-Fock Approximation 


Historically, the first accurate electronic structure method is the Hartree—Fock 
approximation (HFA) [3]. It is still widely used for single molecule calculations 
and as a starting point for more advanced schemes. The basic assumption of the HFA 
is that the many-electron ground state wave function is a Slater determinant, i.e. an 
antisymmetrized product of single-electron states (spin-orbitals). By applying the 
Rayleigh-Ritz variational principle, the HF equations are obtained, whose solutions 
are the HF orbitals ¢,, (r) and energies e,. For convenience, we suppress the spin part 
of the single-particle wave functions. The HF equations are 


(-;v + Vas (r) + Va(r) + vx) $,(r) = €,0,(r) . (3.5) 


This is a one-electron Schródinger equation where the pair-wise electron-electron 
interaction is replaced by an effective potential Vy + Vx. 
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/32 / 
Va(r) = 2 fo dr er = =] art (3.6) 


is called the Hartree potential and corresponds to the classical electrostatic potential 
due to the electronic charge density n(r) = 5» |, (r)|? of the occupied orbitals. 


same spin 


Gan D» I ap’ Pmt Pm 06, G7) (3.7) 


Ir — r'| 


is the exchange potential which is due to the electron-electron interaction together 
with the antisymmetry of Slater determinants under permutation of two electrons. 
Vx is a non-local potential and has no classical analogue. Both Vy and Vy are 
static *mean-field' potentials, obtained from the time-averaged orbital motion of the 
electrons. Dynamical effects are neglected. The exchange interaction induces some 
correlation between electrons of same spin, which avoid each other due to the Pauli 
principle. Correlation between electrons of opposite spin is completely absent in the 
HFA. By definition, the difference between the exact ground state and the HF ground 
state is called the electron correlation effect (even though mathematically speaking, 
there is correlation between same spin electrons in the HFA). 

There are various methods to take account of electron correlation, often termed 
collectively as ‘post-HF’ methods in the chemical literature. The conceptually most 
simple way to include electron correlation is the configuration interaction (CI) 
method. In CI, a set of Slater determinants is generated from the HF ground state 
by (multiple) particle-hole excitations. The CI wave function is a linear combina- 
tion of these many-electron basis states and the coefficients and total energy levels 
are determined variationally by diagonalizing the Hamiltonian in this sub-space. CI 
can be very accurate for atoms and small molecules, but cannot directly be applied 
to large molecules and materials because the number of Slater determinants grows 
exponentially with system size. For X-ray absorption spectra, CI effects, i.e. mixing 
between Slater determinants, are especially strong at transition metal L-edges and 
lanthanide M -edges, which correspond to excitations into the localized 3d- and 4 f - 
orbitals. For these spectra, CI must be taken into account. This can be done with the 
ligand-field multiplet method which is based on CI of a single atom or a very small 
cluster (see Chap. 4 for details). 


3.3.2 Density Functional Theory 


Nowadays, most electronic structure methods are based on DFT [4, 5]. In DFT, one 
does not try to find approximations to the many-electron wave function. Instead, 
the idea is to directly find the exact electronic density n(r) and total energy, which 
is expressed as a functional of the density. DFT is based on two theorems due to 
Hohenberg and Kohn [4] about the (non-degenerate) ground state of the interacting, 
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inhomogeneous electron gas. The first theorem states that the external potential Vext 
is uniquely determined by the ground state electronic density n (r) and that the total 
energy E (minus the external potential energy) is a unique and universal functional 
of n.! The consequence of the theorem is that knowledge of the ground state density 
alone is, in principle, sufficient to determine all properties of the system. The second 
theorem states that the exact ground state density no(r) minimizes the total energy 
functional E[n] in the space of all possible functions n(r). Thus, approximations to 
E and n can be found variationally. 

The Hohenberg- Kohn theorems are exact mathematical theorems. If the universal 
functional E[n] were known, DFT would yield the exact total energy and electron 
density of the interacting electron system. But the exact functional E [n] is unknown. 
Various approximate functionals have been proposed such as the local density approx- 
imation (LDA), generalized gradient approximations (GGA) and hybrid functionals, 
i.e. mixtures of GGA and HF exchange. In practical DFT calculations, Kohn-Sham 
theory is employed, which introduces an auxiliary, non-interacting system which, 
by definition, has the same electronic density as the real, interacting system. In the 
auxiliary system, the external potential is called Kohn-Sham potential Vs and it is 
the sum of the true external (nuclei) potential and an effective one-electron potential 
which replaces the electron—electron interaction. The Kohn-Sham potential Vxg is 
given by the functional derivative of the total energy functional E [n] with respect to 
the electron density. The effective electron-electron potential is written as the sum 
of the Hartree potential Vy and a rest, which is called exchange-correlation potential 
Vxc. As the exact energy functional E[n] is unknown, so is Vxc, and the actual 
expression depends on the approximation used (LDA, GGA, hybrid). 

As the Kohn-Sham auxiliary system is non-interacting, its eigenstates are Slater 
determinants made of orbitals ¢, that are solutions of the Kohn-Sham equations 


1 
(-;v + Vea(r) + Vau) + vse) $,(r) = Enr (r) . (3.8) 


The Kohn-Sham equations (3.8) are similar to the HF equations (3.5) except that the 
exchange potential Vx is replaced by the exchange-correlation potential Vyc and 
the expression of the total energy as a function of the orbitals is different. 

DFT takes account of electron correlation through Vyc and generally performs 
better than the HFA for ground state properties. This is, however, not necessarily 
true for excited states for which DFT should, in principle, not be used, because the 
Kohn-Sham orbitals and levels e, describe the auxiliary system and have, strictly 
speaking, no direct physical meaning for the real system. In practice, however, the 
orbitals and energy levels are used in the same way as the HF orbitals, namely, as a 
first-order approximation for the one-electron or one-hole excitations of the system. 


lThe opposite is obvious because when Vext is fixed, the Hamiltonian is known and so all properties, 
including the electronic density, are determined. 
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3.4 Absorption Spectra in the Independent Particle 
Approximation 


Recalling (3.3), the absorption intensity is determined by the transition amplitude 
Mfg = (Pele: $^, r,|o,). If both ®, and o; are Slater determinants made of 
orbitals which are eigenstates of the same one-electron Hamiltonian, then it is easy 
to see that M f, reduces to a one-particle transition matrix element between the core 
orbital |°) with energy e. and an unoccupied orbital |¢,) with energy e; and (3.3) 
simplifies to 


I (o) =4r°aho $ ^ le rl ) 80 — er + €) . (3.9) 
k 


This is the basic equation of XAS in the independent particle approximation. 

So far we have implicitly assumed that $; are the unoccupied orbitals of a ground 
state calculation. However, ® ; is an excited state with a core hole. The core hole acts 
as a local positive charge which modifies the effective potential (Vig + Vx, xc) and 
so the best Slater determinant for  ; is made of a different set of orbitals y than 
the ground state orbitals $4. Accordingly, better results are usually obtained with 
‘relaxed’ orbitals $,, corresponding to a constraint HF or DFT calculation with a 
core hole. As core holes are localized on one atomic site, the symmetry of the system 
is generally lowered in a core hole calculation and the computational cost increases. 
For crystals, in particular, a supercell calculation is needed in order to effectively 
separate the artificially repeated core hole sites. In the following, we shall write x 
regardless for relaxed and unrelaxed orbitals. 


3.4.1 Dipole Selection Rules and Density of States 


For the calculation of the dipole transition matrix elements (4; |e - r|$^), it is useful 
to expand the states |@;) in a spherical harmonics basis centred at the atomic sites R;. 
By doing so, the dipole transition selection rules known from atomic physics can be 
exploited. This simplifies the calculation and yields an interpretation of the spectra 
in terms of projected density of states as we shall see. We write 


Pr = P BhmXilm> Xim (Œ) = RuGri)Ym Gr) , (3.10) 


ilm 


where r; = r — R;, Y;,, are spherical harmonics, R;; radial functions and B;;,, com- 
plex coefficients. The core orbital is localized at some site (ie). Therefore, only 
orbitals X;ım with i = i, give a non-zero contribution to the matrix element. Next 
we write the dipole operator as a spherical tensor product e - r = $- g D eara, 
where q = 0, +1 are the spherical components of a vector a, given by ao = az, 
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a+ = (Fa, — iay)/ 4/2. The angular integrals of the matrix elements can then be 


simplified with the help of the Wigner—Eckart theorem [2] 


1 
(nm ir, nm) = (7-1 7" a : i) Qn). A) 


where ( : :) are Wigner-3j symbols and (n'l’||r||n/) are reduced matrix elements, 
which are independent of m, m', q. The Wigner-3j symbol is non-zero only for l’ = 
| X: 1 and m =m + q. These are the dipole selection rules. For example, for K- 
edge spectra / = m = 0 and thus only /' = 1, i.e. p-type final states can be reached. 
If polarized light is used we further have m’ = q, e.g. in z-polarization only p; states 
are probed. We thus see that XAS is a local probe of the unoccupied electronic states, 
where different orbital symmetries can be projected out by appropriately choosing 
the absorption edge / value and the light polarization q. 

Using the expansion (3.10) and the dipole selection rules (3.11), we find for the 


transition matrix elements from a core orbital $5, 


(PIF q\P iim) = La I1 mq UG lY ma Falom) $ (3.12) 


The absorption intensity (3.9), dropping constants, from a core shell with angular 
momentum /, located at site i, for light polarization q becomes 


2 


ll) = 95 Y Biia mag (Xidelmtg\qlPfim)| io — e Fe). (68.13) 


km 


As defined in (3.10), the orbitals R;(r) and the expansion coefficients B are, in 
principle, energy dependent. This is the choice in multiple scattering theory which 
allows a minimal basis set (one orbital for each site and /). In the following, we 
neglect the slow energy dependence of the radial waves. We then obtain 


L(o)* YO Me q Mima? Bh mpg Bim 9 —6 €). — Q4) 
k 


m,a,b=+ 


where l4, lp = | + 1 and M, " 


projected density of states matrix 


= (Xi, m-qIrg1ó;5,). We introduce the local, orbital 


Pim ^w (€) = Y (Xime (E — e) (Pil xim) = J Bi, (e — e) Bi, . 8-15) 
k 


k 


Note that the usual partial density of states (DOS) is given by the diagonal elements 
(lm = I’m’). So (3.14) can be written as 
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I, (o) © Mis q Min qPilam+q mtg (Ro + €e) . (3.16) 


m,a,b=+ 


We see that the absorption intensity is a weighted sum of a few partial DOS com- 
ponents with angular momentum l, = l + 1. For high enough symmetry, the inter- 
ference terms a Æ b vanish, leaving only the diagonal, usual partial DOS. In some 
cases, e.g. for the linear dichroism at the sulphur L» 5-edges in MoS», it was found 
that interference between p — s and p — d transitions is non-negligible [6]. For the 
special case of s-wave core states (K, L1, M; edges) where | = m = 0, the selection 
rules (3.11) give l’ = 1, m = q, such that the absorption spectrum for q-polarized 
light is directly proportional to the p;-DOS (where q = 0, +1 orq = x, y, z). 

In this section, we have seen that in the independent particle and dipole approx- 
imation, the X-ray absorption spectra are approximately given by a weighted sum 
of partial DOS with momenta / + 1, m + q. The weighting factors are local tran- 
sition matrix elements and reflect the light polarization and orbital symmetry. As 
à consequence, XAS can be used to probe the unoccupied DOS of the material in 
a site and orbital-resolved way, which gives detailed insight into the local bonding 
properties [7]. 


3.5 Absorption Spectra in Linear Response TDDFT 


3.5.1 Time-Dependent Density Functional Theory 


DFT is a ground state theory whose application to excited states is ill-founded. 
However, a large class of excitations can be computed using the time-dependent 
version of DFT. Time-dependent DFT (TDDFT) is the generalization of standard 
DFT to time-dependent external potentials Vix: (r, t). It was pioneered by Zangwill 
and Soven in 1980 [8], who developed a linear response theory for optical absorption 
spectroscopy of atoms using a time-dependent version of the LDA. In 1984, Runge 
and Gross [9] generalized the Hohenberg-Kohn theorems of DFT to the case of time- 
dependent systems, thus putting TDDFT on a rigorous theoretical ground. TDDFT 
has been applied to XAS of solids for the first time in 1998 by Schwitalla and 
Ebert [10] and to molecules in 2003 by Stener et al. [11]. 

The problem at hand is to find the time-dependent electron density n (r, t) of an 
interacting electron system subject to a time-dependent external field. In TDDFT, 
the exact time-dependent electron density n (r, t) can, in principle, be found from the 
knowledge of the external field, the universal energy functional E[n(r, t)] and the 
initial density n(r, 0). Linear response functions, including absorption coefficients, 
can be expressed as integrals over the electron density change induced by a time- 
dependent external field. Thus, if the exact functional E[n(r, t)] were known, TDDFT 
would allow to obtain exact absorption spectra. As in the case of time-independent 
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DFT, however, the exact functional is unknown. Moreover, finding good approximate 
functionals is even more difficult in TDDFT than in standard DFT. 


3.5.2 Linear Response Theory 


Here we shall outline the theory of absorption spectroscopy in linear response follow- 
ing Zangwill and Soven [8]. We consider an interacting electron system as described 
by the unperturbed Hamiltonian H in (3.4), and try to find its response to a time- 
dependent applied field ge (r, t) such as the electromagnetic field of an X-ray beam. 
The perturbation Hamiltonian is written as 


H'(t) — IS t)n(r, t)dr , (3.17) 


where n (r, t) is the electron density. It differs from the density of the unperturbed 
system n? (r) by the induced density 


én(r, t) = n(r, t) — na) . (3.18) 


The fundamental assumption of linear response theory is that the response of the 
system, ôn, is proportional to the applied field @ext, i.e. 


án(r,t) = IE rt — galt, t), (3.19) 


where x, the response function, is an intrinsic property of the unperturbed system. 
In the frequency domain, this relation reads 


önt, o) = f darxt, v esso) (3.20) 
It can be shown that x is given by the retarded density-density Green's function 


xm, r,t—7)- —i0(t — t) OAC, t), AQ", £^)]]0) , (3.21) 
where fi(t) = e/P' fie !"' is the density operator in Heisenberg representation, |0) is 
the exact ground state of H with energy Eo, [, ] denotes the commutator and 0 (x) is 
the Heaviside step function [0 (x) = 1 for x > 0 and 6(x) = 0 for x < 0]. By insert- 
ing a complete set of excited states »^,, |m)(m| and performing a time-frequency 
Fourier transformation, we obtain the following exact (‘Lehmann’) representation: 


(01 Qr) m) (m|A@’)|0) (01 Œ) |m) (mf (7)]0) 


, (3.22 
ho — En + Eo + in ES ho + Em — Eo ig ( 


X (r, r, w) — 
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where |m) are excited states with energy Em and ņ is an infinitesimal positive number. 
The exact eigenstates and energies of the interacting electron systems are unknown, 
so (3.22) cannot be evaluated directly. For a non-interacting electron gas, however, 
all eigenstates are Slater determinants, and (3.22) can be calculated. The only exci- 
tations which give non-zero matrix elements are single particle-hole excitations 
[m) — C$ Ch [0) with energy €, — €n, where p and h label states above and below the 
Fermi level, respectively. This gives the response function in the independent particle 
approximation 


Xo(r, r', @) = 5 PTP HO, Ah’) 


h]. 3.23 
hio — €p + €n + in [p < h] ( ) 


hp 


If the electrons did not interact we would have x — xo. But they do interact. In 
TDDFT, the interaction is handled as in DFT, by introducing an auxiliary, non- 
interacting system with the same electron density n(r, t) which corresponds to a 
time-dependent effective potential. In the real system, the density change dn(r, t) 
is induced by the external perturbation qe (r, t). In the auxiliary system, however, 
the density n(r, t) corresponds to the sum of the Kohn-Sham potential Vgs and the 
perturbation qe. As the Kohn-Sham potential depends on the density, a density 
change ôn gives rise to an induced field pina (r, t) = 5Vxs[n(r, t)]. Thus, the density 
change ôn (r, t) is due not only to the true external potential ext but also to the induced 
field Ging. Note that there is a feedback effect: go, > 6n > Gina > én > ÓQind ..., 
so we need to solve for ôn and giga self-consistently. Further, in linear response theory, 
a linear relation between the induced charge density and the induced field is assumed 


Yina (rt) = | dr'dt' K (rt, r't')ón(r't^) (3.24) 


which defines the interaction kernel K. We have Yina = ô Vgs = ô Vy + ôVyc, where 


,ón(r't) 
lr — r'| 


ó Vxc (rt) 
snr’) 


8Vy(rt) = Ji dr , 6Vxc(rt) = 1 dr’ dt’ áón(rt). (325) 


The total time-dependent perturbation in the auxiliary system is often called the *local 
field’, Ploc = Pext + Pina. As the electrons of the auxiliary system are independent, 
they respond to the perturbation Pioc with the free response function xo, i.e. ón(rt) = 
f dr'di' xo(r, r’, t — t")ódi (rt^). By construction, the charge densities of the real and 
auxiliary systems are the same, so we have 


X Pext = én = X0 Ploc = X0(Pext E Kón) = Xo(1 + K X )Pext , (3.26) 


where arguments and integration symbols have been suppressed to simplify the nota- 
tion. Since ga is arbitrary, we have 


X=xXotxoKx 9 x=% -K)'. (3.27) 
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So the full response function x can be calculated from free response function xo 
and the kernel K, by iteration or inversion. Equivalently one can calculate the local 
potential directly by iteration of Ploc = Pext + K X0Pioc [8]. The problem is that the 
exchange-correlation part of the kernel 


Kxc (rt, rt) = a“ (3.28) 
is not known exactly. The adiabatic approximation consists in using a static exchange— 
correlation potential, which may be taken from standard time-independent DFT. 
In this case, Kyc = [6Vyc(r)/dn(r’)]6(t — t") such that g,q(t) changes instanta- 
neously with ôn (t). As a result, K (r, r', w) is frequency independent and dynamical 
screening is neglected. X-ray fields correspond to fast oscillations, so neglecting 
dynamical effects is questionable. 


3.5.3 Absorption Spectra 


The optical absorption coefficient is essentially the imaginary part of the response 
function x as we shall show now. We consider an electromagnetic wave given 
by E(r, t) = eEoe/4*^^', where e is the light polarization vector, not to be con- 
fused with the electric charge e. The induced electrical polarization is P(r, t) = 
—eón(r, t)r and so the change in energy density is -E P = eE - rôn. In the dipole 
approximation, E(r, t) ~ eEoe |^" and the perturbation in (3.17) is given by 


Qexi(r, w) = eEoe-r. (3.29) 


The total induced dipole moment is 
Io) = -e f rònt, o)dr = -es [ rx (r, r', o)e- r'drdr/ , (3.30) 


and the absorbed energy is Re{E-dy/dt} or equivalently Im(oE - u(c)). The 
absorption coefficient o (c) is the absorbed energy divided by Eĝ, which yields 


o (o) = -4raho | dráre -r Imx (r,r, œ)e- r. (3.31) 


This expression of the absorption coefficient is fully equivalent to (3.3). In the inde- 
pendent particle approximation, we put x — xo and obtain from (3.23) 


aolo) = 4r ahw D> leple - rlon) Sho + €i — €p) (3.32) 
hp 


in agreement with (3.9). 
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In summary, TDDFT with linear response provides a rigorous and efficient 
framework for calculating absorption spectra. Compared to the independent particle 
approximation, TDDFT takes the screening of the electromagnetic field into account 
by introducing the induced field @ing(t) which is calculated self-consistently with 
the density change ôn (t). In practice, the problem is to find good approximations for 
the unknown exchange-correlation kernel K xc. The Hartree part alone, i.e. putting 
K xc = 0, yields the well-known random-phase approximation (RPA) [10]. Apart 
from the single particle-hole excitations included in xo, the RPA can describe plas- 
mon excitations, i.e. collective oscillations of electron gas, which can be observed, 
for example, as satellite peaks in core-level photoemission spectra. The RPA kernel 
also gives rise to a redistribution of spectral weight between different transitions. 
This may strongly change the peak intensity ratio, e.g. between the Lz and L5 white 
lines in transition elements [10]. In adiabatic TDDFT, K yc can be obtained from 
standard DFT [8], but such static approximations to K xc do not improve much over 
the RPA [10, 12]. It appears that complex configuration mixing such as multiplet 
excitations cannot be described by the common, adiabatic kernels. Going beyond 
the adiabatic approximation is difficult, but some non-adiabatic kernels have been 
proposed and applied to the X-ray absorption problem [13]. 


3.6 Photoemission Spectroscopy 


PES is probably the most direct way of probing the electronic structure of materials. 
In a PES, light is shone on a surface and the kinetic energy, and possibly exit angle 
and spin, of the emitted electrons is measured. In core-level PES, electrons from 
the inner atomic shells are excited. As these levels are element specific, core-level 
photoemission is a powerful tool for chemical analysis. 

Angle-resolved core-level photoemission from crystal surfaces is known as X- 
ray photoelectron diffraction [14]. The photoelectron wave spreads from the core 
hole site and is diffracted by the neighbouring atoms. Analysis of the diffraction 
pattern gives precise information about the local structure around the atoms of a 
given chemical species. X-ray photoelectron diffraction can be well modelled with 
real-space single or multiple scattering theory on a finite cluster of atoms. 


3.6.1 Angle-Resolved Photoemission Spectroscopy 


Angle-resolved photoemission spectroscopy (ARPES) is the major method for mea- 
suring energy band dispersion (the “band structure") of crystals. An intuitive picture 
of ARPES is provided by the three-step model [15]. The three steps are as follows: 


]. Photon absorption in the bulk of the material resulting in an inter-band transition 
|mk) — |nk), with e(n, k) = e(m, K) + ho. Here n, m are band indices and the 
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three-dimensional crystal momentum vector k is conserved up to a reciprocal 
lattice vector G. 

2. Propagation of the excited wave to the surface, with damping due to inelastic 
scattering. 

3. Transmission through the surface by matching the Bloch wave |nk) to a plane 
wave exp(ik’ - r). The matching conditions are dictated by conservation of energy 
and the surface parallel component of k, i.e. kj, = kj and k? = ki +2mVo/h’, 
where Vo represents the surface potential barrier. 


The three-step model is very useful for relating the photoemission data to the three- 
dimensional band structure of the material. However, for an accurate calculation 
of ARPES intensities, the one-step model should be used, where the photoelectron 
final state is calculated in all space (bulk, surface and vacuum) as a single wave func- 
tion with proper boundary conditions (so-called ‘time-reversed low-energy electron 
diffraction’ boundary conditions). A suitable computational scheme is the layered 
Korringa-Kohn-Rostoker method [16]. 

The hole left behind in the photoemission process is not an independent particle, 
but it interacts with the electrons and the lattice, giving rise to various many-body 
effects, which are conveniently described using quasiparticle theory. 


3.7 Quasiparticle Theory 


In a photoemission experiment, an electron is ejected from the system, which 
becomes ionized. Neglecting the interaction between the photoelectron and the hole 
left behind, i.e. applying the so-called sudden approximation, the photoemission 
excitation is a one-electron removal process from the N-particle ground state to a 
N — | particle excited state. In the same fashion, inverse photoemission probes the 
one-electron addition process from the N-particle ground state to a N + 1-particle 
excited state. The true excitations are called quasiparticles. In the limit of vanish- 
ing electron interaction, the quasiparticle wave functions are the spin-orbitals of the 
ground state Slater determinant and the quasiparticle energies are the one-electron 
levels. In the independent particle approximation, the quasiparticles are taken as HF 
or Kohn-Sham orbitals. This neglects electron correlation and the interaction of the 
electrons with the lattice vibrations. These effects change the quasiparticle energies 
and wave functions. The quasiparticles are said to be renormalized or ‘dressed’ by the 
interaction. In particular, due to inelastic scattering at collective excitations such as 
phonons and plasmons, the one-electron quasiparticles will decay after a character- 
istic lifetime. As a result, compared to the delta-function-like photoemission peaks 
corresponding to the independent particle approximation, the true photoemission 
peaks are energy shifted and lifetime broadened. Moreover, some spectral weight 
of the main peak is lost to extra (‘satellite’) peaks, corresponding to some inelastic 
process. 
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3.7.1 Green's Functions 


Quasiparticles can be described using many-body Green's function techniques. We 
introduce the retarded one-electron Green's function 


G(r, i, t — t) = —i6(t — t(0|( (rt), Wt (rt^))]0), (3.33) 


where W* (rt) is a Heisenberg field operator which creates an electron at point r and 
time f, and V destroys one. |0} is the many-particle ground state and (A, B) = AB + 
B A denotes the anti-commutator. Note that we have suppressed spin for convenience. 
This Green's function, or ‘propagator’, gives the probability amplitude for an electron 
to be found at rt if one was added at r’t’. The one-electron removal and addition 
spectrum is given by the spectral function 


A(k, @) = -LImG(k, w), (3.34) 


where G(k, œw) is the space and time Fourier transform of (3.33). In the following, 
we focus on a perfect crystal and suppress the band index. The Hamiltonian of the 
non-interacting system is then given by Ho = » ', exfix, where k labels the Bloch 
eigenstates with energy ey and ñg is the corresponding occupation number operator. 
It is easy to see that in this non-interacting case, Green's and spectral functions are 
given by 

Go(k, w) = (o — ek ip) |, Alk, w) = (w — ék) . (3.35) 


Thus, the photoemission peaks are delta functions, meaning that Bloch states are exact 
excitations of energy ey (band energy) and infinite lifetime. As mentioned above, due 
to electron interaction, the true photoemission peaks are shifted, broadened and may 
have satellite structures. In quasiparticle theory, these effects are described by the 
so-called self-energy X, which is essentially the difference between the inverses of 
the exact and the free Green's function. The self-energy is defined through the Dyson 
equation 

G=Got+tG@tG e G!=G)'-x. (3.36) 


For a single band in a crystal, we have 
G (k, w) = o — e — Erlo) . (3.37) 


It is clear from (3.35) and (3.37) that ReX describes a shift of the eigenvalues ey 
(band energy) and Im results in peak broadening, i.e. it reflects the finite lifetime 
t = h/Im% of the quasiparticle. There are various methods to find (approximate) 
self-energies. For the electron correlation effect, two of the most popular methods 
are the so-called GW approximation and dynamical mean-field theory. 
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3.7.2 GW Approximation 


The GW approximation was invented by Hedin in 1965 [17] and owes its name from 
the form of this self-energy, which is X; = iG W, i.e. the product (or convolution) of 
Green’s function (G) and the screened Coulomb interaction (W). The latter is given 


by [18] 
2 -1 / 
Wir, r, o) = £ feo (3.38) 
4T E9 Ir — r'| 


where e(r, r’, w)~! is the inverse dielectric function. This expression may be under- 
stood by analogy to the electrostatic energy between two electrons in a polarizable 
medium, which is given by e /[47: eoe, |r — r'|], where e, is the relative dielectric per- 
mittivity. The dielectric function e(r, r', w) generalizes e, to inhomogeneous media 
and dynamic screening effects. The GW approximation is most often used in a non- 
self-consistent way, i.e. as & = i Go Wọ with the free Green's function Go instead 
of the full Green's function G. The GW approximation has been very successful 
for correcting band energies of weakly correlated systems. In particular, bandgaps 
of semi-conductors are very well reproduced in the GW approximation, while the 
values obtained in DFT (except for DFT-HF hybrid functionals) are systematically 
too small [19]. 


3.73  Bethe-Salpeter Equation 


Atthis point, we briefly switch back to the problem of absorption spectroscopy. Since 
light absorption creates an electron-hole pair, absorption spectra are described with 
an electron-hole (i.e. a two-particle) Green's function Gen. If the excited electron 
and the hole do not interact, Gep is just the product of the one-particle removal (hole) 
Green's function G; and the addition (electron) Green's function Ge. Electron—hole 
interaction leads to coupling of these two Green's functions, which may be expressed 
in a Dyson-type equation as [20] 


Ga, (1,2; 1,2) = G,(,1)G4Q, 2) (3.39) 
+ J Ge(1, 3)G4 (2, 4) K (3, 4; 5, 6)Gen (5, 6; 1', 2')d3d4d5d6 , 


where 1 stands for all coordinates of particle 1 and K is the interaction kernel. 
In the Bethe-Salpeter equation (BSE) approach, (3.39) is solved with K given by 
the screened Coulomb interaction in (3.38) and the bare exchange interaction. The 
electron and hole Green's functions, Ge and Gh, are commonly computed in the 
GW approximation. The BSE approach is arguably the most accurate first-principles 
method for absorption spectroscopy in solids, but it is computationally very demand- 
ing. It was first applied to X-ray spectra by Shirley in 1998 [21]. It accounts well 
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for strong excitonic effects and features electron-hole multiplet coupling in L-edge 
spectra [22]. Let us note that the latter effect is also well described with multichannel 
multiple scattering theory [23, 24], where the electron-hole coupling is dealt by a CI 
calculation of the scattering matrix. However, at present, none of these particle-hole 
theories can fully account for the complex multiplet structure of L-edge spectra of 
open-shell transition metal compounds. These spectra are still best described with 
CI methods, either the semi-empirical ligand-field multiplet model (see Chap. 4) or 
the ab initio complete active space approach on small clusters [25]. 


3.7.4 Static and Dynamical Mean-Field Theory 


In strongly correlated electron systems, e.g. 3d transition metal oxides and 5 f ele- 
ments, collective phenomena such as band magnetism, metal-insulator transition 
and high-T, superconductivity are observed. These are genuine many-body effects 
that cannot be explained in the independent particle picture. Itinerant magnetism 
and the metal-insulator transition are due to the competition between the kinetic 
energy, which leads to delocalized band states, and strong local Coulomb repulsion 
which favours electron localization and formation of magnetic moments. The most 
simple model to study these problems is the (one-band) Hubbard model [26], whose 
Hamiltonian is given by 


= = F 
H = ene + US nani = Cm + U Y nana, 
ko i i 


ijo 


where lattice sites are labelled by i and j, the crystal momentum by k and spin 
by c. Further, ct (c,) creates (destroys) an electron in state v, and n, = cfc, counts 
them. t;; is the hopping (or ‘transfer’) integral between sites i and j, and U is the 
Coulomb energy between two electrons occupying the same site. The corresponding 
one-electron Green's function is 


Gic (0) = [o — €x — Ero (9)] | . 


Despite the apparent simplicity of the Hubbard model, the exact solution is unknown 
(except in one dimension) and the self-energy X must be approximated. At the lowest 
level, there is the normal mean-field (i.e. HF) approximation, where X is taken to 
be static, i.e. independent of frequency o. It is given by 344 = U (n, 4), where the 
occupation numbers (n, 4) must be calculated self-consistently. When the lowest 
energy solution corresponds to a different occupation between spin-up and spin-down 
bands (n+) A (nky), the band energies Ezo = e; + U (nx_,) become exchange split, 
and the ground state is ferromagnetic. HF and LDA are such static mean-field theories 
and can account for certain static exchange effects, such as ferromagnetism. But they 
lack all dynamic correlation, which is crucial for the metal-insulator transition and 
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for various phenomena seen in photoemission spectra, such as band narrowing and 
satellite structures. 

Dynamic correlation effects can, to some extend, be described by the dynamical 
mean-field theory (DMFT) [27], where the self-energy is taken to be frequency 
dependent but local, i.e. momentum independent, X, (vw) — X5 (v). We note that 
self-energies from static mean-field theory (such as LDA) and dynamical mean-field 
theory can be, and often are, combined. Although the total self-energy is then both 
momentum and frequency dependent, it is still an approximation. 

The basic idea of DMFT is to map the Hubbard model with correlation (U 4 0) 
on all lattice sites onto the Anderson model, which describes one correlated atom (the 
*impurity") coupled to an effective bath of band states. In the Anderson model, we 
have U +Æ 0 only at the impurity site (i = 0), and as a consequence, the self-energy 
is a frequency dependent, but local, quantity 2;; (c) = 29(«)6;90;;. In DMFT, this 
local self-energy is taken as the self-energy of the lattice problem (Hubbard model). 
The mapping, i.e. the definition of the effective bath, must be done in a self-consistent 
manner such that the on-site matrix elements of the lattice model Green’s function 
Gj; (v) coincide with those of the impurity model [27]. While the Anderson impurity 
model is simpler than the Hubbard model, it is nonetheless a complex many-body 
problem. Implementations of DMFT mostly differ in the approximations used for 
solving the impurity problem. 

DMFT has been applied to photoemission spectroscopy of correlated systems [28] 
and results in improved spectra compared to independent particle approximation (HF 
or LDA). In transition metal systems, for example, photoemission spectra calculated 
in DMFT can account for finite temperature effects, correlation-driven band narrow- 
ing and satellite peaks [29]. 


3.8 Conclusions 


In this chapter, I have tried to give a brief introduction to the theory of X-ray absorp- 
tion and photoemission spectroscopy. Along the way, it appeared useful to present 
succinctly the principles of several computational methods of electronic structure 
that are used in spectroscopic calculations. Given the vast nature of the subject, this 
account is necessarily very incomplete. But I hope that the reader got an idea of 
the physics underlying the different theoretical methods and that it aroused his/her 
curiosity to dwell deeper into the subject by reading some of the cited literature. 
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Chapter 4 A) 
X-ray Dichroisms in Spherical Tensor m 
and Green's Function Formalism 


Hebatalla Elnaggar, Pieter Glatzel, Marius Retegan, Christian Brouder, 
and Amélie Juhin 


Abstract In this book chapter, our goal is to provide experimentalists and theo- 
reticians with an accessible approach to the measurement or calculation of X-ray 
dichroisms in X-ray absorption spectroscopy (XAS). We start by presenting the key 
ideas of different calculation methods such as density functional theory (DFT) and 
ligand-field multiplet (LFM) theory and discuss the pros and cons for each approach. 
The second part of the chapter is dedicated to the expansion of the XAS cross section 
using spherical tensors for electric dipole and quadrupole transitions. This expansion 
enables to identify a set of linearly independent spectra that represent the smallest 
number of measurements (or calculations) to be performed on a sample, in order 
to extract all spectroscopic information. Examples of the different dichroic effects 
which can be expected depending on the type of transitions and on the symmetry of 
the system are then given. 


4.1 Introduction 


4.1.1 The X-ray Absorption Cross Section 


The X-ray absorption cross section is obtained by dividing the transition rate by the 
flux of photons and summing over all possible final states. It is given in (4.1) where 
hw is the photon energy, and o the fine structure constant. / (Ej) and F (Ep) are the 
initial and final state wave functions (energies), and T is the transition operator, 
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o, = 4? oho Y U|T* | F)(F|TU[D)SCE; + ho — Ep) . (4.1) 
F 


The transition operator describes the interaction of photons with the system. In the 
case of an electromagnetic plane wave, the transition operator writes 
T x eik" [he m ik x e] where € is the polarization vector of the incident 
photon, and k is the incident wave vector, g the gyromagnetic ratio (g ~ 2 for the 
electron), and s the electron spin [1]. The exponential in the transition operator can 
be expanded as a Taylor series 


(k. r}? 


ikr n TE 
e &l-cik-:r 3 


(4.2) 


The first term in the expansion approximates the interaction of the light with 
the atom as an electric dipole ((F|T|I) = h(Fle + V|I) = —"—EP(Fle . r|])). 
The second term gives rise to the electric quadrupole interaction 
(-i mEr—ED (F le-rk-r|Z)) and to the (negligible) magnetic dipole one 
(-$(Fl(e x k)-(L+ gs)|7)) [1], the third term is the octupole transition 
("Ern ED CF|Ce - r)(k - 7?|2)) [2], and so on. In this chapter, we focus on electric 
dipole and quadrupole transitions. 

The summation over final states in (4.1) implies that one has first to calculate 
the ground state, all possible final states, and then compute the transition matrix 
elements between the ground state and the final states. This is not always the most 
efficient way to numerically calculate XAS. Instead, Green's function can be used 
to replace the summation over final states by à propagator of the transition opera- 
tor. aoe Yor |F) (F| (Er + ho — Ep) > ET L (G+ — G-) with G*(E; + hw) = 
where Hp is the final state Hamiltonian. The “Fermi Golden Rule" 


ho— Bane ir? 
can be expressed as in (4.3). Most modern codes calculating core level spectra use 
this expression 


o,, = —4raholm [U|T* G* (E; + ho)T 1)] . (4.3) 


Let us now discuss electric dipole transitions according to the first term of the 
expansion in (4.2). For electric dipole transitions we have T = € - r. One can see 
from the expression of the transition operator that the cross section will depend on 
the orientation of the polarization vector (e) with respect to the absorbing system (r). 
The X-ray absorption spectrum measured on any sample is in fact the sum of several 
linearly independent spectra as will be discussed further in this chapter. They can be 
disentangled by macroscopically orienting the sample, e.g., by using a single crystal 
or orienting the magnetic moments. Consequently, one may wonder: 


— How many independent spectra exist for a given system? 
— What information do they give us about the absorbing system? 
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4.1.2 Definition of Dichroisms 


X-ray dichroism can be defined as the difference in the X-ray absorption cross section 
measured for two orthogonal polarization states of the incident light. There exist dif- 
ferent types of dichroism. Dichroism measurements can be classified according to 
the type of polarization used for the measurements into linear and circular. Linear 
dichroism (LD) is the difference measured with linearly polarized light, where in 
most cases the polarization vector is set parallel and perpendicular to an orienta- 
tion axis, while circular dichroism (CD) is the difference measured with circularly 
polarized light (left handed and right handed). 

Not all systems exhibit dichroism effects when the polarization of the light is 
changed. Certain symmetry conditions regarding the interaction operator between 
light and matter have to be satisfied for dichroism effects to occur, which brings us to 
the second classification of dichroism types. Two symmetry operations are essential 
for this classification: 


— Time-reversal symmetry, 
— Space inversion (also called parity). 


Natural dichroism (ND) refers to dichroism effects that occur in non-magnetic 
systems where time-reversal symmetry is conserved (i.e., the system is even under 
time-reversal operation). Using linearly polarized light, one can measure X-ray nat- 
ural linear dichroism (XNLD). On the other hand, using circularly polarized light, 
one can measure X-ray natural circular dichroism (XNCD) only for systems that 
do not have a centre of inversion (i.e., the system is of odd parity). 

Magnetic dichroism (MD) relates to dichroism effects measured in magnetic 
(ferro, ferri, or antiferromagnetic) systems where time-reversal symmetry is broken 
either by spontaneous magnetic ordering in the sample or by the application of an 
external magnetic field. X-ray magnetic linear dichroism (XMLD) is parity-even 
and time-reversal even and non-reciprocal linear dichroism (NRLD) is parity-odd 
and time-reversal odd. Using circularly polarized light, X-ray magnetic circular 
dichroism (XMCD) and X-ray magneto-optical dichroism (XM x D) effects can be 
measured. The former is parity-even and time-reversal odd while the later is parity- 
odd and time-reversal odd. 


In magnetic materials, which will be discussed in this chapter, several cases are 
possible: 


— In the case of centrosymmetric crystals with ferro- or ferrimagnetic properties, 
one can measure XMCD. 

— In the case of centrosymmetric crystals with antiferromagnetic properties, XMLD 
can be measured. 

— In the case of magnetized, non-centrosymmetric crystals, XM x D and NRLD can 
be measured. 


XMCD and XMLD measurements give, respectively, access to the average value 
of (M) and (M?) of the local magnetization for the absorber. On the other hand, 
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XM xD and NRLD signals are related to moments that are more complex, the anapole 
orbital moment and other higher order moments. 


4.1.3 The Many-Body Problem in Spectra Calculations 


The calculation of an absorption spectrum is a formidable task: it requires the cal- 
culation of the ground state of the system, the excited states of the system, and the 
interaction of the system with the electromagnetic field (X-ray beam). This means 
that the theoretical approach required to calculate XAS has to be suitable for cal- 
culating the electronic structure in addition to properly considering the interaction 
with the electromagnetic field. Approximations have often to be made to calculate 
the absorption (or the scattering Kramers-Heisenberg) cross section and the spec- 
troscopist therefore has to choose which theoretical approach is the most suitable for 
the problem at hand. 

In principle, the ground and excited states can be determined by solving the 
Dirac equation which accounts for all relativistic effects and includes all possible 
interactions in the Hamiltonian. This full treatment provides a relativistic, many- 
body, extended description of the electronic states. Unfortunately, in practice, it is 
not possible to perform such a calculation as it is computationally very consuming. 
In most cases one solves instead the Schródinger equation and introduces relativistic 
effects as perturbations (e.g., the spin-orbit interaction). Furthermore, one can make 
use of the Born-Oppenheimer approximation to separate the electronic properties 
of the system from the dynamics of the nuclei. In order to describe the electronic 
part of the wave function, various theoretical approaches can be used such as (1) the 
single-particle extended picture (DFT-based approaches), (ii) the many-body atomic 
picture (multiplet theory), and (iii) the many-body extended picture (beyond DFT 
methods). 


4.1.3.1 The Single-Particle Extended Picture of Electronic States 


DFT-based methods can be used to describe the electronic states using a single- 
particle extended picture. Although DFT methods should formally only apply to 
ground state calculations, they are often used for the calculation of excited states 
probed in core level spectroscopies. DFT methods simplify the ground state wave 
function of N electrons by replacing them with a fictitious, non-interacting system of 
independent electrons, that have the same electronic density as the real system. The 
correct charge density minimizes the total energy of the system. The Schródinger 
equation is transformed into a system of equations (called the *Kohn-Sham equa- 
tions") with an effective Hamiltonian and wave functions, which are functions 
of only one space variable. This implies that DFT is essentially a single-particle 
approach, although some many-particle (many-body) interactions are contained in 
the exchange and correlation term of the electronic effective potential. The exact 
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analytical expression of this term is not known. This means that DFT can only be 
applied in an approximate form for example using the local density approximation 
(LDA) or the generalized gradient approximation (GGA)]. 

One can determine the ground state wave function by solving the Kohn-Sham 
equations which are constructed from the single-particle wave functions. The ground 
state wave function is therefore formed by a single Slater determinant. This is an 
important point because it limits the ability to treat a many-body response of the 
system described through a linear combination of Slater determinants. To illustrate 
this let us take an example with two electrons. Coupling s — 1/2 to s — 1/2 yields 
S = 1 or 0. This corresponds to four |S, Ms) wave functions: |S = 1, Ms = 1), 
|S = 1, Ms = —1) |S=1, Ms = 0), and |S = 0, Ms = 0). These functions need 
to meet the property of being anti-symmetric under particle exchange and it can 
be shown that only |S = 1, Ms = 1), |S = 1, Ms = —1) can each be expressed as 
a single Slater determinant. However, the |S = 1, Ms = 0) and |S = 0, Ms = 0) 
states can only be expressed as the combination of two Slater determinants and thus 
cannot be calculated in DFT. 

One can group the various DFT-based methods according to their characteristics: 


— Cluster or periodic: The Kohn—Sham equations can be solved either for a cluster 
centred around the absorbing atom (direct or real space methods) or starting from 
a unit cell of the crystal (or a multiple unit cell, which is called supercell) in order 
to take advantage of the 3D periodicity (reciprocal space methods). 

— Self-consistency or not: The Kohn-Sham equations can be solved without or 
(preferably) with self-consistency, i.e., using an iterative cycle where two suc- 
cessive steps are mixed until a convergence criterion is reached to determine the 
charge density. 

— Type of basis functions used to expand the orbital solutions of the Kohn- 
Sham equations: either localized functions [linear combination of atomic orbitals 
(LCAO), linear muffin-tin orbitals (LMTO)], or delocalized functions [plane waves 
(PW), full-potential linearized augmented plane waves (FLAPW)]. 

— Approximation made on the shape of the electronic potential: For example in 
LMTO or multiple scattering theory, the potential is approximated to be spherically 
symmetric in the atoms, and constant between them (muffin-tin). In full-potential 
methods [FLAPW, or projector augmented wave (PAW)-pseudopotentials], no 
approximation is made, which is generally preferable, even though it makes the 
calculations more consuming. 


4.1.3.2 The Many-Body Atomic Picture of Electronic States 


A Simple Introduction to the Many-Body Atomic Picture 


Let us consider as an example a Crt ion in an octahedral (O;,) environment. Here 
the solid is reduced to an atom embedded in a mean field known as the crystal field 
(CF) that mimics the effect of the inter-atomic interactions. The atomic electronic 
configuration is 1s225?2 p635?3 p63d?. The degeneracy of the Cr 3d levels is lifted 
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due to the CF and the 3d orbitals are split into two groups: the e, orbitals pointing 
towards the ligands and the t2, orbitals pointing between the ligands. The number of 
possible electronic states is given by the number of allowed arrangements of three 
electrons into ten spin orbitals, i.e., C ih = 120 microstates. The energy separation 
between these states arises due to the combined effect of: (1) CF splitting, (ii) elec- 
tronic repulsions, and (iii) spin-orbit coupling, i.e., the multiplet effects. Electrons 
occupying closed shells do not actually contribute to the energy splitting of the elec- 
tronic levels; there is only one way to completely fill a shell giving a single average 
energy of the configuration. 

All these multiplet states can be further grouped in so-called term symbols (or 
spectroscopic terms) according to their energy, spin and orbital moments. The relative 
energy positions of these spectroscopic terms for a 3d" transition metal ion in Op CF 
(and neglecting 3d spin-orbit coupling) were calculated by Tanabe and Sugano and 
are available in several references (e.g., [3, 4]). Similar diagrams are available in [5] 
for symmetries lower than O}, such as trigonal or tetragonal. The relative energies 
of the electronic states depend on the CF parameters as well as the Racah parameters 
that relate to the electronic repulsions. The determination of the spectroscopic terms 
becomes very complex when the spin-orbit coupling and Zeeman terms are included 
in the Hamiltonian and/or if lower symmetries are considered. LFM theory takes 
these effects into account and has been realized in several computer codes. 


Key Ideas of Ligand-Field Multiplet Theory 


Atomic multiplet theory, crystal field multiplet theory, and LFM theory (sometimes 
collectively referred to as the multiplet theory) are based on concepts that were 
developed in atomic physics and make use of group theory. One has to solve the 
Schródinger equation for the ion with its N electrons in a given configuration 


É|g) = Elg), (4.4) 


where Z is the Hamiltonian of the system for the chosen configuration, E and |g) are 
the eigenvalue and eigenstate, respectively. The different eigenstates are functions 
of N electrons, hence they are called many-body (or multi-electronic) states. The 
Hamiltonian is expressed as 


H=T+V+ ce + Hso + Her , (4.5) 


where f is the kinetic energy of the electrons, V the Coulomb attraction between elec- 
trons and the nucleus, Vee the electron—electron Coulomb interaction, H so the spin- 
orbit coupling interaction, and Hc r the CF Hamiltonian, which takes into account 
the local environment of the absorbing atom. 

These interactions will now be expressed in second quantization formalism. In 
this notation, any operator can be expressed in terms of creation (c!) and annihilation 
(c+) operators. The operator c! creates a state characterized by the quantum numbers 
t (for example, if we choose to express the states as spin-orbitals, r will be the set 
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of quantum numbers n, l, m, o that give the principal quantum number, the orbital 
momentum, the projected orbital momentum, and the projected spin momentum that 
uniquely identify this state) when it acts on the vacuum state |0). The operator c, 
is the annihilation operator of the state c. In this formalism, the (spherical) atomic 
interactions write 


22. 
^ p 4 
Î = X (ul—In)ch ey, (4.6) 
p 2m 
^ Ze " 
Va 5 al- lne en, (4.7) 
Trst 
Fe duce e (4.8) 
ee = x Ti T2 x x |T3 T4 CL Cr Cr Ct, 5 $ 
2 Tj, 02,03, T4 IF = r'| ' i 
Vso = X (mills It)c} cn . (4.9) 


T1, T2 


Here p is the linear momentum operator, m is the electron mass, e is the electron 
charge, ? is the position operator, Í and $ are the orbital and spin momenta operators, 
and é is an atom dependent constant that is a function of the gradient of the atomic 
potential (£ œ 1 d). The kinetic energy of the electrons and the Coulomb interaction 
of the electrons with the nucleus are fixed for a given atomic configuration and they 
contribute only to the average energy of the configuration; hence T and V do not 
contribute to the multiplet splitting, and will not be further discussed. As a matter 
of fact, they are typically not evaluated in standard multiplet calculation programs. 
However, we are left with the task of simplifying the terms Yso and V... Let us start 


with Vio and assume that z is the quantization axis. 
Influence of Spin-Orbit Coupling Interaction 


The spin-orbit interaction is given as follows: 


Vso = lids. + Ly Sy + l;5.)|v)c], Cr 


T],T2 


= X (tlElzsz|t2)ch es (4.10) 


T1,T2 


+Y (nlEss, + lysy)lt2)c} es . 


Ti, T2 


We shall use a set of atomic  spin-orbitals as basis functions, 
Wi = Rn (r)Yn n; (0, $)o; where Y; m; (0, Q) is the spherical harmonic, and Rp, 7, (r) 
is the radial part. Given that the potential in € has a spherical form one can separate 
the radial and angular parts of the Hamiltonian. The angular part of the first term 
gives 
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> (Yn m UE MMC T m, o, Ch.m,02 
(mi). (l2,m2) 
1 l 
= T ii 
=5 S neut — Gein + (4.11) 


m=—l 


The second term gives 


n 
2. (Yi sm 1 IL, + lysy) | Yi, m,02)6, m, ,0; Ĉb,m,02 
(1,m1).(5,m2) 


MEC " 
= IN (i m0115 (57 1757)] Yn ,02)6, pis Ch, (4-12) 
(li, mı), (lo,m2) 
l—1 


— > vil — m)(l t m c Dj. m4, , Clm,t F c} m, qel m+1, » : 


2 


Hence the angular part of the spin-orbit Hamiltonian finally writes 


l 
^ 1 1 - 
Hso — 2 > mj, ACLm,4 = eee) 
m=—l 
1 I-1 . . 
+5 vil —m)l+m+ D mp1, elms se 6] x Flops . (4.13) 


m--l 


It is clear from (4.13) that the spin-orbit interaction mixes states with different 
projected orbital and spin momenta. 


The Electron-Electron Coulomb Interaction 


Now we undertake the simplification of the electron-electron Coulomb interaction. 
This is more involved than the simplification of the spin-orbit coupling Hamiltonian. 
Cowan nicely explains the details of the derivation in his book [6]. We will rely on a 
combination of the derivations by Cowan [6] and Haverkort [7] in this section. This 
is not a thorough derivation; it is only meant to qualitatively explain the origin of 
multiplet splittings. 

The ig step to simplify this Hamiltonian is to o perform a multipole expansion of 


the term zm EY X ox Y B s $3 x E Y, 4, (0, Ø), where r£ and r£ are, 


respectively, the lesser and greater of the distances r and r’. Now we are in a position 
to separate the radial and angular terms of the expression and separate the angular 
variables of each electron. Using the atomic spin-orbital basis to express the matrix 
elements of the angular part of the Hamiltonian one finds that 
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k 


XO nm, ))01Yi m0, à )01 YS) Vein (OO Vem. 0) 


T1, T2, T3, T4 k=0 m=—k 


Yn; 9, O3Y, m, 6’, No. 
l,m (0, $)03Yi m, (0 , Q )oa) (4.14) 


oo k 
=). M OL (nm 0, p)0ilYrm (8. 9)0Yi n. (0. 0)03) 


k=0 m=—k 1,12,73,T4 


(Yim (0^, pozl Yk m (0', $^)|Yi m, (0^, $^)o4) . 


We have in (4.14) integrals involving three spherical harmonics which are given 
by the Gaunt coefficients. This can be used to restrict the values of the summa- 
tion over k and m. The Gaunt coefficients are different from zero in the first inte- 
gral only for m = m, — m3. Similarly, the second integral is different from zero for 
m = m4 — m». Hence, in combination, one concludes that the total M, is conserved 
(mı + m» = m3 + m4) for the integrals. Furthermore, the values of k are restricted 
to values of k < min(|l; + lil, |l; + l|). This simplifies the angular part. 

Let us now investigate the radial part. The general expression for a scattering 
event involving four different shells is expressed in (4.15) 


min(|Ii +4] ,J2+/3|) rk 
Yo Gua R40. lR Rn). (4.15) 
Tj, T2, T3, T4 k=0 f> 
In the case of Coulomb interaction in a single shell, n; = n» = n3 = n4 and 
L = h = l; = l4 and hence one obtains the expression 


2l 


k 
FO = DO RORO ETIR 0 Rar) ; (4.16) 
k=0 T 


For a 3d" configuration, 2/ — 4 yielding values of k — 0, 2, 4 and one has to evaluate 
three Slater integrals Fík) for such a configuration. For an excited state with a core 
hole, like the excited state of a L absorption edge with a configuration 2p?3d"*!, 
it is necessary to take into consideration the Coulomb interaction between the 2p 
and the 3d shells. There are two possible cases for these scattering events (see also 
Chap. 2): 


1. Direct interaction with n; = n3 and n» = n4. This means that each electron scat- 
ters in its shell. The matrix elements read 


min(|2l; ,215|) r* 
FOS y (Ruan Rnae OOF Ran Rn)» 417) 
k=0 7 


2. Exchange interaction with n; = n4 and n = na. This means that electrons 
exchange shells. The matrix elements read 
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min(|2l;,215|) 


k 
/ re 
Go = J (Ra, a, Rnb )| v | 5,5 C) Ry, (r’)) . (4.18) 
k=0 g 


The F® and G™ are called Slater integrals. The magnitude of the multiplet split- 
ting depends on the magnitude of the Slater integrals. For an atomic calculation 
(corresponding to the case of a free ion in spherical symmetry) radial integrals are 
calculated self-consistently using a Hartree-Fock model, with values typically of the 
order of a few eV (not more than tens of eV). In the point group symmetry, where 
the absorber is considered with its environment, these values are reduced empirically 
in order to take into account the effect of the chemical bond which delocalizes the 
electrons. This reduction factor is an adjustable parameter (typically, 60-80% for a 
iono-covalent bond, 100% being the ionic limit case of a free ion). 


The Crystal Field Hamiltonian 


Let us now extend our theoretical framework to include the effect of the CF potential 
on the absorbing ion. This is done by considering the N nearest neighbours as point 
charges (Z;e) at positions R;. The electrostatic potential due to these point charges 
at position r, Vcr (r), is expressed as 


Ze 
Vcr = Y A. (4.19) 


A multipole expansion of the potential can be used to expand (4.19), leading to the 
expression 


oo k 
Vcr( 6,0) =e 35 r 2. Crm, Ø) Qo » (4.20) 
k=0 1=—k 


with Qk m = a 5 | Zi (i MP ee s (8i, di). Ykm is the spherical harmonic, 
Ck,m is the renormalized spherical harmonic, and * is the complex conjugate. Note 
here that one assumes that the radial extent of the 3d orbitals is smaller than the 
distance between the absorbing ion and its first neighbours (i.e., r «& R;). The CF 
Hamiltonian can be developed using the single-particle basis (atomic spin-orbitals 
as discussed before for the Coulomb interaction). Separating the radial and angular 
parts leads to 


Hen = 9 Y Ati (Ya m 011C ml Yu 02)6) 69 « (4.21) 


T1,72 k,m 


Here, the Ak n combine all the radial parts of (4.20). They are related to the usual 
CF parameters (10Dq, Ds, Dt, ...) which are usually not known precisely. The CF 
parameters (or Ax, ) are either fitted parameters or taken from experiments (optical 
absorption, electron paramagnetic resonance, ...). It is important to warn the reader 
against the temptation to fit the calculation with an unreasonable number of CF 
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parameters: with adjustable parameters, one runs the risk of producing some spectra 
in good agreement with experiments not for a theoretically justifiable reason but 
rather due to some lucky cancellation between inappropriate choices of the param- 
eters and inaccurate theoretical approximations employed in solving the model. It 
is therefore important that one uses some sensible limits for these parameters and 
critically examines the values used in the model. 

Equation 4.21 is a useful expression because the matrix elements are integrals 
again over three spherical harmonics and are given by the Gaunt coefficients. Fur- 
thermore, the series can be truncated according to the triangular condition. For 3d 
orbitals (like in the case of an Fe ion), 0; = 42 = 2 so the maximum value of m 
possible is m = 4 with k < 4. The actual form of the matrix elements depends on 
the symmetry of the CF potential. 

We will present the example of an octahedral (O;,) cluster to illustrate the proce- 
dure of calculating the CF matrix elements. In the case of an O, cluster, six neighbours 
are positioned at equal distances from the central ion as shown in Fig. 4.1. We will 
first calculate the multipole terms possible for this configuration according to (4.20). 
This can be easily evaluated and many softwares are available such as the “multi- 
poles" Python package [8]. One finds that the multipole expansion of the octahedral 
potential reduces to the following terms with the coefficients listed below Co,o — 6, 


Cao > P, Caa > C44 > 2. 


Now we can evaluate the matrix elements of (4.21) using the Gaunt coefficients. 
One finds the following matrix for Op, crystal field 


10005 
0-40 0 0 
X Oom lCrml Yom) « ]0 06 0 0]. (4.22) 
k,m 00 0-40 
50001 


Diagonalizing this matrix gives the following eigenvalues E = 6, —4, —4, 6, —4 for 
the eigenvectors 


1 1 
Z — 0 0 0 

0 0 1 0 0 

0], 0 ,10|,.11/,10 (4.23) 
0 0 0 0 1 

1 1 
Z a 0 0 0 


This illustrates the splitting of the 3d one electron orbitals in an O, crystal field as 
shown in Fig. 4.1. The energy difference between the t2, and e, orbitals is referred 
to as 10Dq and its magnitude depends on the radial part Az, m. The five degenerate 
orbitals split into two types of orbital: 


1. Three orbitals of energies —4Dq referred to as the t2, orbitals and which are 
v5 05.2 — Y22), Y2,1, and Y» 1. 


94 H. Elnaggar et al. 


Fig. 4.1 Splitting of the five 
degenerate 3d orbitals (left) 
into fy, and e, orbitals in an 
octahedral crystal field 
(right) 


3d 10D, 


2. Two orbitals of energies 6Dq referred to as the e, orbitals and which are 
5 (Y2,-2 + Y25), and Y20. 


The same procedure can be used to determine the eigenstates and energies of the 
3d (or any other) orbitals embedded in a certain symmetry. 


4.1.3.3 The Many-Body Extended Picture of Electronic States 


The most recently developed approaches aim at extending both DFT and LFM theo- 
ries to provide a comprehensive description of many-body interactions. An example 
for a rather manageable improvement to DFT is to add a static Hubbard parameter U 
to account for on-site electronic repulsion. In addition, there are other approaches that 
go well beyond DFT. They can be grouped into two main types: the quantum chem- 
istry approaches (mainly, wave function-based methods) and the Green's function 
methods. 

Quantum chemistry approaches (DFT-CI, multi-configurational self consistent 
field [configuration interaction (CI)], coupled cluster, quantum Monte Carlo) are 
many-body extended approaches. In DFT-CI, for example, a combination of Slater 
determinants is used to describe the wave function of the system. The number of 
configurations that can be considered is limited by computing power and one has to 
decide which configurations to include. These approaches can only be applied for 
small clusters and molecules due to computational demand. Green's functions based 
methods, such as GW and dynamical mean-field theory (DMFT) provide an alterna- 
tive approach to calculate the electronic structure of strongly correlated materials. In 
a GW method, a screened Coulomb interaction (W) is calculated following a DFT 
calculation of the charge density. A nonlocal energy dependent self-energy operator 
is required. Furthermore, DMFT can be utilized to map the full lattice problem onto 
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a single site quantum impurity problem, using a local screened Coulomb interac- 
tion (U). In both GW and DMFT approaches, correlations (many-body effects) are 
significantly better described than in standard DFT. 


4.1.3.4 Which Approach Works Best for Core Level Spectroscopy 
Calculations? 


Unfortunately this is a very complex question: it depends on the chemical system 
(ionic, covalent, strongly correlated, ...), the edge (K-, L-edges, ...) and the type 
of spectroscopy (absorption, photoemission, ...). Consequently, in the following we 
try to provide some guiding considerations. 

A good starting criterion for the choice of the method of calculation is the local- 
ization of the final state wave function. When electrons are excited into highly delo- 
calized orbitals, they interact much less with other electrons meaning that the intra- 
atomic interactions are expected to be small. In this case single-particle approaches 
often give satisfactory results as the multi-electronic effects are small. This may be 
the case for the K edge of 3d elements and the L» 5 edges of heavy elements (e.g., 
rare earths, 5d transition metals), as well as the K edges of ligands (such as C, N, O, 
S). 

On the other hand, when electrons are excited into localized orbitals, they will 
strongly interact with each other and the core hole. In this case, the multi-electronic 
effects become significant, and description considering only the absorber atom may 
be more successful such as LFM theory. This is typically the case for L» 4 edges of 
3d elements and M4,5 edges of 4 f elements. 

In intermediate cases where both intra- and inter-atomic effects are relevant, like 
the K pre-edge of 3d elements, single-particle and many-body approaches may work 
or fail. It may also happen that different energy ranges of a spectrum are best described 
by different theoretical approaches. This is often the case when a weak pre-edge 
feature is observed before the strong main edge. The pre-edges often arise from 
excitations into localized orbitals while the main edge has a more delocalized char- 
acter. Whether the final state is localized or not may be derived from the shape of 
the absorption edge. When the edge is dominated by a step function, the final state 
can be assumed to be delocalized. If it exhibits distinct peaks that decrease sharply, 
this may be due (but not necessarily) to a localized final state. 


4.1.4 Codes for Ligand-Field Multiplet Calculations 


Regarding the practical aspects of calculating core level spectra, one faces yet another 
level of complexity as the plethora of theoretical methods mentioned above are 
implemented in a comparably large number of computational packages. Instead of 
trying to provide an overview of all the available computational tools, we will focus 
on introducing Quanty, one of the currently available software packages that can 
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be used to perform ligand-field multiplet calculations, and Crispy a graphical user 
interface that uses Quanty as a computational engine. 

Quanty [9-12] is developed by Maurits Haverkort and his collaborators at the 
Institute of Theoretical Physics at Heidelberg University. It is a computational library 
that can be used to write quantum mechanical programs in the second quantization 
formalism. While the library can be used to describe a wide range of problems, 
it is specifically aimed at calculating different spectroscopies, including core level 
spectroscopy. Briefly, the user starts by constructing the Hamiltonian for the system 
of interest, diagonalize it, selects several eigenstates, and then calculates the spectrum 
corresponding to these eigenstates. In Quanty the Hamiltonian is expressed in a basis 
of one particle modes, which can be both fermionic and bosonic. The fermionic modes 
are usually spin-orbitals. In semi-empirical multiplet calculations, the interactions 
between the spin-orbitals are parametrized using values calculated for isolated atoms, 
which are afterwards scaled to account for the effect of the surrounding atoms. 
Alternatively, the parameters can be calculated directly by using for example DFT- 
based methods [9]. After the diagonalization of the Hamiltonian and the selection 
of the lowest eigenstates, using, for example, Boltzmann statistics, Quanty can be 
used to calculate the spectrum. As mentioned previously, this is done using a Green's 
function approach, thereby avoiding the sum-over-states calculation, which can lead 
to an important reduction in computational time. 

The core of the library is written using the C/C++ programming language for 
maximum efficiency. The users do not interact directly with this part of the code, 
but rather with the Lua-based layer that wraps it. To run calculations the users are 
required to write small programs using the functions defined in Quanty. Doing this 
in a scripting language such as Lua has the advantage of providing an ideal environ- 
ment for experimentation, circumventing the limitations of compiled programming 
languages such as C or C++. While this is indeed very helpful, it is not uncommon 
for such programs to reach more than a few tens of lines of code, which in itself 
can be intimidating for the majority of new users. Also, as it is the case for many 
scientific libraries, because of the flexibility given to the users when writing these 
programs, it is impossible to check for all the things that might be incorrect. This 
leads to errors that are difficult to trace even for experienced users. 

To help users to more easily perform Quanty calculations, one of us (M. Rete- 
gan) has developed a friendly user interface that exposes the library's capabilities 
for a large part of core level spectroscopies. Crispy [13] was developed using the 
Python programming language and relies on additional packages from the Python 
ecosystem. The main window of the application is shown in Fig. 4.2. Using Crispy, 
the users can quickly adjust the parameters of the calculation, run it, and plot the 
resulting spectrum without the need of writing any programs. The approach has many 
advantages for novice users, but even experienced ones can use Crispy to generate 
a starting program that will become the basis of their calculation. Crispy is a free 
and open-source program that can be installed on any operating system that has an 
up-to-date Python distribution. For Windows? and macOS® the program comes in 
easy to install packages that can be downloaded from the official website, http:// 
www.esrf.eu/computing/scientific/crispy. 
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Fig. 4.2 Crispy’s main window showing a calculated XAS L» 5 spectrum for a Co?* ion in octa- 
hedral symmetry 


4.2 Spherical Tensor Expansion of the XAS Cross Section 


A spherical tensor is a set of components that transform into each others under 
arbitrary rotations. Another way to state this is to say that the components of a 
spherical tensor generate a vector space which is invariant under rotation. A spherical 
tensor is irreducible if this vector space cannot be written as the sum of two invariant 
(non-zero) subspaces. For an irreducible tensor of rank j, the dimension of the 
corresponding vector space is 2j + 1. For example, spherical harmonics Y; n are the 
spherical tensor components of a spherical tensor of rank /. Note that while a 3 x 3 
matrix is an irreducible Cartesian tensor, it is a reducible spherical tensor which is the 
sum of j — 0, j — l, and j — 2 irreducible spherical tensors. It is evident that such 
an expansion would provide us with deeper insights by identifying groups of spectra 
that obey certain symmetry transformation rules which one could easily relate back 
to the system symmetry [14]. 

Spherical tensor analysis has been used with great success for the X-ray photo- 
electron of localized magnetic systems [15-19] and in XAS [14, 20-22], including 
XNLD [23]. The underlying idea is to determine a finite set of fundamental spectra 
in terms of which all possible experimental spectra can be expressed. More precisely, 
the XAS spectrum obtained for a given polarization vector (€) and wave vector (k) 
of the incident beam is written as a sum of terms which are fundamental spectra [21, 
22] (depending only on the sample properties) multiplied by an angular coefficient 
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depending only on the experimental conditions (k, €). Such a geometric and fully 
decoupled expression is useful: (1) to disentangle the properties of the sample from 
those of the measurement; (ii) to determine specific experimental arrangements aim- 
ing at the observation of specific sample properties; (iii) to provide the most conve- 
nient starting point to investigate the reduction of the number of fundamental spectra 
due to crystal symmetries. 


4.2.1 The Case of Electric Dipole Transitions 


The first step is to build rank one spherical tensors from the vectors appearing in the 
transition operator. The polarization vector e = [€,, €y, €,] can be written as a spher- 


: d 1 — & ié 1_ €x ie, y S 
ical tensor €" with components €_, = Jg epe JE? and €; = €z. Similarly, 


the position spherical tensor, r!, can be constructed. In the following we shall use 
the following notation for the coupling of spherical tensors P^ and Q^ of ranks a 
and b into a spherical tensor of rank c 


a b 
(P* & O°} = Y M (aebflcy) P: O}, (4.24) 


a=—aa=—b 


with (aabB|cy) being the Clebsch-Gordan coefficients. Therefore, 


P*.Q* 2 Y (-D*P*,Qt = (-D*V2a + 1(P* & GQ"). (4.25) 


a=—a 


One has now to compute the scalar product of both tensors which is given by (4.25). 
The dipole transition operator can be written as in (4.26) taking into consideration 
that r is real while € is in general complex 


T = —V3{e! @r'}° 


4.26 
Tt =—V3{e* @ ry. (4.26) 
We can recouple the cross section such that polarization tensors are coupled to each 
other and position tensors are coupled to each other. This means that the expression 
will have a part that depends only on the experimental geometry (polarization vector) 
and a part that depends only on the sample properties. This recoupling can be done 
using the identity 


da dia 
(P* & O°} - (R^ & S" zy jo e xam 


//Qg + D Qd + 1) 
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Here a is constrained to |g — d| <a < g +d. Hence, for the dipole transition, 
g =d = l and 0 <a x 2. The recoupled XAS cross section is finally expressed 
as follows: 


2 
0, = —4raħolm [Lene &ely. eerte ; (4.28) 


a=0 


Quanty can calculate the energy dependent tensors R® = ((I|r! G*r!|I) ^ that 
depend only on the properties of the sample. We will refer to these elements as the 
fundamental spectra. Note that these fundamental spectra are sometimes referred to 
as o ^. This could be confused with the total cross section ca so we shall not use 
this notation here. The experimental geometry tensor is E^ = {e!* Q e!Y^. 


4.2.1.1 Terma — 0 
The first term can be found by substituting a = 0 in (4.28). This is the zero rank of 


the tensor, given in (4.29). 


c (0,0) = 4rahw x if (utrctoG*rCial n  UlrCt aG*rCi al) 


w| = 


+CH G* Can) |; (4.29) 


Here Cem = CE = |j n Y?" . The term o (0, 0) is independent of the incident polar- 


ization vector and as such is rotation invariant. It gives the isotropic contribution of 
the XAS cross section. 


4.2.1.2 Terma=1 


The terma = | consists of three components, namely, o (1, 0), o (1, 1), ando (1, —1): 


1 
o(1,0) = —4raho x DEC — ie) 


x (Wich trey ald) = (rch 6*r6 an) . (4.0) 
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c(1,1) = —4raħw x imf €; — EE + ie,€7 — iei«) (4.31) 


—— = We 
2/2\* 
(UirCfoGt rc ald) + itirct, eren) (4.32) 


1 
o(1,—-1) 2 —4raħw x in| (ee — Exe ie eg = iee) 


24/2 
x (urctoG*re. i Da trct.G*reun)| . (433) 


One notices from (4.30), (4.32) and (4.33) that the spectra of a — 1 are not active if 
any of these two cases are satisfied: 


1. If linearly polarized light is used for the measurement. 
2. If all off-diagonal elements are zero and the diagonal elements are equal. 


Another conclusion that can be drawn from the recoupling is the necessity to perform 
XAS measurements using both linearly and circularly polarized light to probe these 
fundamental spectra. 


42.1.3 Terma =2 


The term a = 2 consists of five components, namely, o (2, 0), o (2, 1), o (2, —1), 
o (2, 2), and o (2, —2). These are given below as follows: 


1 
22,0) = —Amat x Im] (2? le? ley P) (2UIrChoG*rCrol) 


—-(rCtz ,G*rCi. aD) — trct,G*reun)| ; (4.34) 


1 
o(2,1) 2 —4raħw x im] (se + ee, — ieee — iej«) 
342^ * TRUM 


x (utret,.,6*reiuoln = treten) . (435) 


1 
c(2, —1) = —4raħw x m (se + ee, tie*e, tie «) 
Bai S ? iin 


x (utrcto atre ail) — rct ,G*reuin)| , (4.36) 
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o(2,2) = —4raho x in| > ; («. - icy (ee — ie) 


«(iret 1G*re.,n) | (4.37) 


o(2, —2) = -raho x In| 7 (tioii) 
x (uret.otres un) (4.38) 


It can be noted from (4.34), (4.35), (4.36), (4.37), and (4.38) that the a — 2 spectra 
are active for linearly polarized light and hence these spectra are responsible for the 
angular dependence observed with linear light. On the contrary, no difference can be 
Observed between right and left circularly polarized light. Another feature of these 
terms is that they are not active if the following two conditions are satisfied: 


1. The diagonal matrix elements are equal. 
2. The off-diagonal matrix elements are zero. 


4.2.1.4 General Dipole Expression 


The general dipole expression is given in (4.39). From this equation, the dipole 
XAS cross-section for an arbitrary polarization (€) can be constructed from the nine 
fundamental spectra derived above. 


1 1 
arise) = —Adrahw x Im Exc 0+- 5 (ies ey — ie,€ >) RA, 0) 
1 
24/2 
1 


(ete; — ef + ieye® — ie;e) RU, 1) 
+—— (eže; — €,€7 + ieve, — ieye*) RU, —1) 
2/2 ( 2 [4 y y Bl 
1 
+ d le. le,?) RQ,0) 


(exe? + eže, — ie; — ice) R(2, 1) 


E 


E (exe? + eže, + iefe, +ieyež) RO, —1) 
ES | (e. — ie)(et — ie) R(2, 2) 


1 
= (Cex Hie) ie) RQ, -»| ; (4.39) 
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where the R are the fundamental spectra and are defined as 


R(0, 0) = UlrCroG ^ rCiol) + (rch 46'76, lp 


+r Ci G*rCi all), (4.40) 

R(,0) = (rC GrG lZ) — (IrCC GrG aM), (4.41) 

RQ, 1) = UlrCt9G*rCialD)  lrCt 4G*rCiollD , (4.42) 

R(1, —1) = (IrC}oGtrCi aM) + Ir Ce G*rCy oll) , (4.43) 
R(2, 0) = 2U|rCt9G*rCiolD) — (IrC}_GĦrCi l) 

-ülrCt Gtr all) , (4.44) 

R(2, 1) = (rC ,G*rCiolJ) — Ur Ce oGtrCi al), (4.45) 

R(2, —1) = U|rCt4G^rCi aM) — Ur Ce G*rCy oll), (4.46) 

R, 2) = (I|rCf_,G*rCi ald) , (4.47) 

R(2, —2) = (I|rC¥ .G*rCi aD. (4.48) 


4.2.1.5 Case Study of a d? ion 
Octahedral Crystal Field 


Equation (4.39) can be simplified when the symmetry of the absorbing system is 
taken into account. As a demonstration, we shall study a d ? ion in octahedral (O;,) 
symmetry. Figure 4.3 (top) shows the matrix elements for such a system. These 
matrix elements are the direct output of Quanty and will be referred to as the con- 
ductivity tensor. One finds that all the off-diagonal matrix elements are equal to zero 
and all the diagonal matrix elements are equal. This leaves only the R(0, 0) term of 
(4.39) not equal to zero. Hence one can conclude that the cross section of a dipole 
transition is isotropic in an O; system. 


Tetragonal Crystal Field 


Let us now consider a tetragonal distortion such that the octahedron is compressed 
along the z-axis. The ground state in this case has a hole in the d,2 orbital (neglecting 
spin-orbit coupling) and the z-axis is now different from the x- and y-axes. The 
conductivity tensor for such a system is shown in Fig. 4.3 (bottom). As could be 
intuitively expected, the middle panel corresponding to C}G* C, is different from 
the other two diagonal elements. This implies that the following terms come into 
play (see Fig. 4.4): 


e R(0,0) which gives the isotropic cross section. 
e R(2,0) which has a polarization dependence of the form i (21e: |? — je, |? — ley I”). 


It is interesting in this case to investigate what types of dichroism effect could be 
observed. Consider rotating the incident linear polarization vector in the x — y-plane. 
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Fig. 4.3 Conductivity tensor calculated for a d? ion in an octahedral crystal field (top) and in a 
tetragonal crystal field (bottom). Calculations are done with a crystal field parameter 10D, = 1.1eV 
and with D, — —0.2eV in tetragonal symmetry. Re and Im are the real and imaginary parts of the 


tensor 
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Fig. 4.4 Fundamental spectra R(0, 0) and R(2, 0) for a dipole transition for a d? ionina tetragonal 
crystal field (10D, = 1.1 eV and D, = —0.2 eV) 


In this case € = [cos(0), sin(0), 0] where 0 is the rotation angle defined from the 
x-axis. The expression of the polarization dependence for the term R(2, 0) reveals 
that no angular dependence is to be expected in this case. The system is effectively 
On in the x — y-plane and one would expect no angular dependence as discussed in 
the previous example. This XAS cross section for this rotation is shown in Fig. 4.5a. 
On the contrary, rotating the polarization vector in the x — z (e = [cos(0), 0, sin(0)]) 
Or y — z (e — [0, cos(0), sin(0)]) planes should yield an angular dependence as the 
polarization vector probes the distortion, which is indeed observed as shown in 
Fig. 4.5b and c. The dependence of the XAS cross-section on the direction of the 
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Fig. 4.5 Angular dependence of the L2, 4 XAS of a3d? ion ina tetragonal crystal field. Calculations 
are done by rotating the polarization vector in the x — y-plane [panel (a)], x — z-plane [panel (b)], 
and y — z-plane [panel (c)] as illustrated in the top panel 
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linearly polarized light is an effect referred to as linear dichroism as discussed pre- 
viously. 


Octahedral Crystal Field with an Exchange Field || z 


Another interesting system to investigate is a magnetic 3d? ion where the crystal 
field is O, with an exchange field aligned along the z-axis. Hence, the z-axis is 
inequivalent to the x- and y-axes due to the exchange field. The conductivity tensor 
of such a system is shown in Fig. 4.6. The exchange field is aligned along a high 
symmetry direction in this example which preserves the C4 rotation symmetry of 
the system and consequently preserves the symmetry of the conductivity tensor. All 
off-diagonal elements are zero. Note that the off-diagonal elements are zero because 
we chose to calculate the tensor using the symmetry adapted transition operators. 
Three fundamental spectra come into play and are plotted in Fig. 4.7: 


e R(0,0) which gives the isotropic cross section, 
e R(1,0) which has a polarization dependence of the form ; (i €xéy — i &e) j 
e R(2, 0) which has a polarization dependence of the form 7 QIe« P? — le? — lel?) à 


Nearly no angular dependence can be observed by rotating the incident linear 
polarization vector in the x — y-, x — z-, and y — z-planes (see Fig. 4.8a, b, and c). 
This is consistent with the fact that the fundamental spectrum R(2, 0) responsible 
for the angular dependence is nearly zero [ R(2, 0) is about two orders of magnitude 
smaller than the other two fundamental spectra in this system]. The difference in the 
absorption cross section of linear polarized light in a magnetic system is an effect 
referred to as XMLD [24]. The magnitude of the XMLD effect for this system can 
be seen in Fig. 4.9. Note that the magnitude of the XMLD effect in Fe3O, is ~1% 
of the XAS signal, which could be reliably measured on existing beamlines [25]. 

A strong dichroism is observed when circularly polarized light is used as in the 
case for Fig. 4.10a. Here the incident polarization vector is either left or right polarized 
about the z-axis leading to a difference in the absorption. This is an effect referred to 
as X-ray magnetic circular dichroism (XMCD) [26]. It can be seen from the expres- 
sion of the polarization part of the cross-section that if the incident wave vector is 
aligned perpendicular to the exchange field, for example, for e — [0, — E Zl and 


€ = [0, — RI — Zl no XMCD effect is observed. This is shown in Fig. 4.10b and c. 
This dichroism can be used to quantify the ground state spin and orbital moments of 


the system as given by the sum rules [27]. 
Octahedral Crystal Field with an Exchange Field || [210] 


As a last example, we consider a system in Cı symmetry. Consider aligning the 
exchange field along a low symmetry direction, e.g., [210]. The exchange field now 
completely breaks the symmetry of the system and the conductivity tensor has off- 
diagonal elements (bottom of Fig. 4.6). Contrary to the previous case (where the 
exchange field was aligned to the z-axis), it is now not possible to find a rotated 
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Fig. 4.6 Conductivity tensor calculated for a d? ion in an octahedral crystal field (10D, = I.1eV) 
with an exchange field (B — 0.05 eV) along the z-axis (top) and along the [210] direction (bottom). 
The calculations are performed using a real spherical harmonics basis set 
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Fig. 4.7 Fundamental spectra R(0, 0), R(1, 0), and R(2, 0) for a dipole transition for a d? ion in 
an octahedral crystal field (10D; = 1.1eV) with an exchange field (B = 0.05 eV) aligned along 
the z-axis of the cluster 
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Fig. 4.8 Angular behaviour of the L2,3 XAS of a 3d? ion in an octahedral crystal field (10D, — 
1.1 eV) with the exchange field (B. = 0.05 eV) aligned along the z-axis. The calculations are done 
by rotating the polarization vector in the a x — y-plane, b x — z-plane and c y — z-plane as depicted 
in the top panel 


basis set that diagonalizes the conductivity tensor for all excited states (1.e., the basis 
set becomes energy dependent). It remains possible to diagonalize the conductivity 
tensor for a given excited state. It is important to realize that when the exchange field 
is aligned along a low symmetry direction, off-diagonal elements become important 
and more dichroic effects come into play according to (4.39) and (4.48). 


4.2.2 The Case of Electric Quadrupole Transitions 


For electric quadrupole transitions, we will follow the same procedure as the one 
used for electric dipole. The transition operator is now (up to a factor of i/2) T = 
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Fig.4.9 X-ray magnetic linear dichroism of a 3d? ion in an octahedral crystal field (10D, = 1.1eV) 
with the exchange field (B; = 0.05 eV) aligned along the z-axis. The dichroism is computed by 
subtracting the XAS: a with € || x from that with € || y, b with € || x from that with e || z and c with 
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Fig. 4.10 X-ray magnetic circular dichroism of a 3d? ion in an octahedral surrounding with the 
exchange field (Bz) aligned along the z-axis. The dichroism is computed by subtracting the XAS 
signal calculated with right circularly polarized light from that with left polarized light. a The 
incident wave vector is aligned parallel to the z-axis. b The incident wave vector is aligned parallel 
to the y-axis. c The incident wave vector is aligned parallel to the x-axis 


(€ -r)(k.r). Itcan be seen from the expression of the transition operator that the cross 
section will depend on the orientation of the polarization vector (€) and of the wave 
vector (k) with respect to the absorbing system. Two recoupling steps are required in 
this case. First, the transition operator can be rewritten into a combination of scalar 
products of two tensors: one tensor that depends only on € and k coupled together, 
and one tensor that depends only on the absorber r. This recoupled transition operator 
is expressed as follows: 
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2 
f= AOG @ k!}{r! 9 ry 
b=0 
M 2 
T! =X ciy(e*eky. ror. 


c=0 


(4.49) 


The next step is to recouple the two transition amplitudes of the absorption cross 
section. This gives the expression 


4 2 2 
Oy = Wah x in| 3: M Cyt’ fle"! @ kY g fe! err 


a=0 b=0 c=0 


x(ujtr! erlyc*tr!o vine | . (4.50) 


Before attempting to write out the recoupled absorption cross section in (4.50), it 
is useful to simplify the expression of the transition operator first. This in turn will 
simplify the expression of the absorption cross section. The transition operator is a 
rank two tensor according to (4.49) with b = 0, 1, 2. We shall write out the three b 
terms: 


e Term b=0 
cite! exl ora (eiki + det, + eel kt) 
x (- rini + Air + Aran) (4.51) 
The first part of the expression can be rewritten as 5 ( Ezk; — eyk, ek). This 


is equal to zero because the polarization vector is orthogonal to the wave vector. 
This means that the term b = 0 is zero. 


Term b =1 
This term consists of three components according to 


CDe! gk} {ri erly = “Bw 


The second part of the expression is equal to zero because it is a cross product of 
the same vector. This means that the term b = | is also zero. 


(e x k)-(r xr). (4.52) 


Term b = 2 

This term consists of five components. These five components can be simpli- 
fied applying the orthogonality between € and k. In addition the r tensor can 
be expressed in terms of spherical harmonics of / — 2 according to the relation 
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ir cir =r} = ,/ S. Yo (r) 2 J Er Y, (0, p). One obtains the following 
five components after simplification 


fl @k yr @rlp = (5. (rô) , (4.53) 


ll Ok Piri er, = ( Imo 5 Wn =) (ri) . (4.54) 
(eek? ert (AS — ; G wa) (1). (4.55) 
f'@k Br er? - (S + ee t =) (r22) . (4.56) 
eorr er} = (= = ees 2 =) (r2) - man 


The same arguments apply for the c — 0, 1, 2 terms of the recoupled T! operator 
ending up with the values b — 2 and c — 2 for the XAS cross section. The recoupled 
Cross section writes 


4 
Ow = n aho? x if YODU S kP oteoxyy 


a=0 


xtultr! er!Pc*(r!o "rir | . (4.58) 


Now one can develop (4.58) in further details for a = 0, 1, 2, 3, 4. We shall only 
report the final expression here. 


42.3 Terma — 0 


Let us substitute a = Oin (4.58). This is the zero rank of the tensor, o (0, 0), describing 
an isotropic spectrum 


1 
o (0,0) = m?aħwk? x Im EST + Ulr^ c; ,G*r^C, lI) 


TUI? C$, G*r? CoD) + Ul? CZ, 48* r^ Co UD) 


&neci6*rcsan)| (4.59) 
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42.4 Terma=1 


The term a = 1 consists of three components, namely, o (1, 0), o (1, 1) ando (1, —1) 


o (1,0) = z?ahok? x in| x (o + 2K? + k2e,e; — (2k? + 2k? + kDete, 
ihkad- Ese bee = kykeeye*) 
x (UIP CE Gt? sun - ülr?c3 Gtr? Cy ag) 


-2Ulr^C$,G* r^ Cr 211) = x6, 6*6. 4) : (4.60) 


c (1, 1) = z2aoliok? x in| 7 (o +k + 2K? — ikxky)ežez 
— (2k + ko + 22 — ikxky)exež — i(k? + 22 + 2k? ikcky)ete: 
ti + 2k2 + 2k2 + ikxky)eyež + (ky + ikx)kz€žey — i (kx — iky)kcex€} ) 
x(vuir?c$ 1G+r?C2 on) veut? ci Gtr? Co, 110) 


420463. Gra, D) + 201 C$ o*?ean)] : (4.61) 


c (1, —1) = z2ofiok? x LE (o + k? + 2k? + ikxky)ežez 
2 72 ; * L2 2 2_. * 

— (KE + KZ + 22 + ikcky)exe? + i(k? + 22 22 — ikxky)ež ez 

-i (KR + 2K? + 212 — ikxky)eyež + kelky — ikx)ežey + ikz(kx + iky)exeš ) 

x (VOIP CF a*igyn veut? es 8*6» n 

X2ulr?c3 Gtr? Cz lI) + 201 C$_1G*2C,-211))| (4.62) 
A quick check of (4.60), (4.61) and (4.62) reveals that the term a = 1 is zero for 
linear light. This implies that these fundamental spectra can only be probed with 


circular or elliptically polarized light. It is also clear that if the conductivity tensor 
has no off-diagonal terms, and satisfies 


uie cor neuro Croan, 
uj cio r Gall = UP CE 6*7€233; 


then the term a — 1 is again zero. 
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42.5  Terma=2 


The term a = 2 consists of five components, namely, o (2, 0), o (2, 1), o (2, —1), 
o (2, 2), and o (2, —2) 


o (2,0) = z?ahok? x in| sz (atte — 2krky€y + krkz€: + 62e, — 3k2e,)ez 
-H(6k2e, — 2k, kye, + Ak;e, + k,k.e. — 3k2e,)e; 
t (-3Ee,  kkze, — 31e, + kk,e, — Sie)et) 
x (2ut?c;,6*r 63D + 2(1|r°C3_,Gtr? Co |) 
-2U|r^C2,96G* r^ Ca ol1) — Ur? C$,G* r^ C540) 


-uec; 6c. un] (4.63) 
2 2 =l ; ; 
o(2, 1) = 22ahok? x tm] s (fete. — 6ikyey + ikyey — kyey)k; 


QE + ikxky + 3k; + 2k? )e,]e* — i[(—kxex — ikyex + 6ikxey + Akyey)kz 
tK — ikyky + 202 + k29e;]ey + [2k2 (Ex — ie) + kj Bex — 2iey) 
oer — 3iey) — 4ikykz€z + ky (ikyey — kyey + kze) 

x (Veutr?c$,6*r?c n - VOU? C$_ Gtr? ol) 


«617? c$ ,G*r?c5510) — uei. 6*0 un)] (4.64) 


_ 2 2 1 ; 2 2 2 
c (2, —1) = z^ahok^ x In 168 [6ikykzex — kykzey + 2kyez + 3kye; + 2k7e; 
—iky (Aikzey + kzey + kyez) et +i — kxkz(ex + 6iey) + kykz (iex + 4ey) 
+3kŽez + ikkyez + 202 an kz*)ez |e + [2k2 (ex + iey) + ke. + 2iey) 
+k? Qe, + 3iey) + Aikykze; + ky(—ikyey — kyey + kze |e?) 
x (VOU? C$ Gt 2,111) = Veutr?c$ 6*6» 011) 


+6(1|r?C3_,G*r?Cy,_9|1) — en?ci,c*?eun)| ; (4.65) 
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1 ; 
c (2,2) = n?aliok? x m — 2(k, — iky)(ez — ie) ke + ke — 2k,e;) 
—3(& (ky — iky) + k (e; — ie) (ek, — iky) + ket — ik;e;) 


2(k, — ikyY(ex — ie) set + et — 2k-€*)) 
2 2 ^x T2 3 2 ^x +,.2 
x( Z(CZ o*r? Cal) + yf 5 UI C$ ,G*r Cual) 


2 
E iei ene un)| (4.66) 


1 
o (2, —2) = z?ahk? x m > ( — 2(ky + iky) (EF + ies) (kxex + kyey — 2k;e) 
4/21 y y yy z-I 


—3(e(k, + iy) + K (es + ies) (es + ky) + et + ikset) 
2k, + iky) (e, + ie) (ket + ket — 2k-€*)) 


2 3 
«(JSumessetie, an + [Bucs etree ln 


2 
sarez a*czan)] , (4.67) 


4.2.6 Terma=3 


The term a = 3 consists of seven components, namely, o (3, 0), o (3, 1), o (3, —1), 
o (3, 2), o (3, —2), o (3, 3), and co Q, —3) 


i 


c (3,0) = z2aliok? x in| xc (a? + ky -2K)ee — (TE -2kDe€ 
+ kirk eet — 2k ks eset + 2 eset — 2kykzexe”) 
x (2ui?c; 6*0, n - 2ulr^ C5, Gr’ C») 


+(I|r?C3,G*r°Co2|1) — uec; 6*6 an) ; (4.68) 
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1 
0/6 
+8kz (ky + ils )exey — (K? + 2ikxky + 3k — Ak2)exe; — 8k (ky + iky ever 


c (3, 1) = n ahok? x ml- (ick — 2ikxky + ky — 4k2)eze* 

*T2 5 ; 2 ; 2 x 
Fe [ky (ex — 3iey) + 2ikxky (ex + iey) + ky Bex — iey) — Akz (ex — iey)}) 
x (2? C§ 6*0 un + 2|r?C3 .16*72c, ol) 


—eulr?c5 ,G*r?c5 511) — Véuv?es 6*0 n)] (4.69) 


1 
c , —1) = zZaliok? x 3 
404/6 


(IGK zik k, +K? — MD; 


Ak2)ete, + 8k (k, — ikx)ežey 


+8ikz (ky + iky)esey + (k? — 2ikxky + 3k 
SRCE 3i6)) 2k. (Ey + ier) — Ger + iey) + 42 (6s ie] 


x(2ut?ci,6*r*e, n 2017 C$ Gtr? C»olT) 


—V6(I|r?C3_ ,G* r^ C», 5D) — V&utcs,6*csun)| ; (4.70) 


1 
c (3,2) = x Zahok? x im| (ks tt ik, (eie, cb + be) 


4/6 


teke + ike, + 2ke) — 2ik.e' (e, — ie) 


x (utc, 6*r can = uec; .6*°C011) "E V 


1 
o(3, —2) = nZahok! x imf (ix + kx) GUS — k,e, + 2k,€,) 
4/6 y y y-y on 
tet (-ik,e, + hye + 2ike;) — 2k.et (e, + ie) 


x (ut?c$,6* rcu = (IWC 9647 Cs, an)| , (4.72) 


1 
c (3, 3) = mahħok? x DHC - iky)”) (= ež (ex — ie) + (et — ie) 


x( - (Ilr°C3 ,G* r^ CoD — mec; *eun)| . (4.73) 


4 X-ray Dichroisms in Spherical Tensor and Green's Function Formalism 115 
2 2 l . 2 * ; * E 
c G, —3) = n?aħok? x Im a (t 4 ik) \(- € (e + ie) - eet + ie) 


x (ui?c$, 6*6 n + U1?C36*r?C>)| (474) 


4.2.7 Terma=4 


The term a = 4 consists of nine components, namely, o (4, 0), o (4, 1), o (4, —1), 
o (4,2), o (4, 2), 0 (4, 3), o (4, —3), o (4, 4), and o (4, —4) 


o (4, 0) = r?ahok? x in| zc (bte + Ile, — Ae, + 2k, ke — Bk. ke.]e] 
+key + 3k?e, — Ak2e, + 2k. ke, — 8kyk.e.]e; 
-4et[e (kK + K? — 212) + 2k, ses + kyey)1) 
x (eut? c;,6*r Cool) — AU|r? C3, G*r? Cod) 
—-4AUpr?^C$, ,G*r^ €» aD) + Ur? C$46* r^ C236) 
uic; 46*r6, 410) (4.75) 
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42k; (kcex +ikxey + ikyex + 3kyey)] + ež [2 Gex — iey) 
+2kxky (€y — iex) — 8kxkzez + Ke — 3iey) + Bikykzez — 4k? (€x — iey)]) 
x (V6? C$ ya * rhe» n - Jéulr?c$. ,G*r?Co.ol) 


uj?c$ 468*60 6, D — mec; ,o*?c.an)| (4.76) 
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o (4, —1) = z?2ohok? x in|] c (« [e GK2 + 2ik,k, +k? — 4i) 

42k; key + ikcey + ikyex + kyey)] + ey [ie (2 — 2ik ky + 3K2 — 4k2) 
2k; (ik,€y + kcey + kyex + 3ikyey)] + e? [k] (Bex + ie) 
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x (Veu?ciu6*rhc n + V6 |r2C% 6*r* Col) 
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= 
GE By CORE & DEC = iky) (« - ie (ek, — iky) 
2k. (e, — i€y)) + et, — ik) (e, — ie) 


x (ui?es 26+? C111) = (C3 e*nesan)| . (4.80) 
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1 
o (4, —3) = n?olok? x ims (s + iky) (Cet ie ds + iky) 
+2K (€x + i€y)) + €? (kx * iky)(ex + ie) 


x (uir?ci,6*r*c, in - iniec;.e*res un) | | (4.81) 


1 
c (4,4) = n Zahok? x DEC =i) ie le = ie) 


x (uie; een) (4.82) 


1 
o (4, —4) = ahok? x Im[ (t. tik) (e, ie) (et ie) 


x(unee:,6*e un) | (4.83) 


In the most general case, the quadrupole XAS signal can be described using 25 
fundamental spectra as given in (4.59)-(4.83). Although the expression seems at 
first sight complicated, major simplifications and intuitive conclusions can be made 
when one considers the symmetry of the absorbing system. We shall illustrate this 
in the following section. 


4.2.7.1 Case Study of a d? Ion 


As an example, we will study again a d? ion in different local symmetries. 


Spherical Symmetry 


We shall start with an isolated d? ion (i.e., spherical symmetry). The conductivity 
tensor of such an ion is shown in Fig. 4.11. The tensor consists of 25 elements that 
form the 25 fundamental spectra through appropriate linear combinations. Only the 
five diagonal elements are non-zero in this case and are all equal. This means that the 
only possibly active fundamental spectra are of the type o (a, 0) witha = 0, 1, 2, 3, 4. 
However, because all the diagonal elements are equal, only the fundamental spec- 
trum o (0, 0) is non-zero. This fundamental spectrum has no angular dependence, 
hence this system is isotropic. It is not a surprising result that for a spherical system, 
no angular dependence would be observed. 
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Fig. 4.11 Conductivity tensor calculated for a d? ion in spherical symmetry. Re and Im are the real 
and imaginary parts of the tensor 


Octahedral Crystal Field 


We shall examine next a d? ion in O, symmetry. The conductivity tensor of this ion 
is shown in Fig. 4.12. Several differences can be directly seen in comparison with 
the previous case: 


e The elements with the transition operator T = C?, are mixed with those of 
Te. 
e The diagonal elements are not equal. 


Let us consider the first point. This mixing leads to the same form of eigenvectors 
than for obtained for the 3d orbitals (Y2 m) for an O, crystal field [see (4.23)]. Indeed, 
this is exactly the same problem. In order to obtain only diagonal elements, one can 
apply the following rotation (4.84): 
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Octahedral crystal field 
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Fig. 4.12 Conductivity tensor of a d? ion in an octahedral crystal field (10D, = 1.1 eV). Re and 
Im are the real and imaginary parts of the tensor 


1 1 
53000 % 
0010 0 
Rot=|9 305 0 (4.84) 
0550-5 1 
i 2 v2 i 
4000 -5 


The rotated conductivity tensor is shown in Fig. 4.13. Only diagonal elements 
exist now. These are separated into two types: three (/5,) that are equal with transition 
operators, C2,, C?. and C?., and two (eg) that are equal with transition operators, 
C2, and ES 
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Octahedral crystal field in symmetry adapted basis 
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Fig.4.13 Conductivity tensor fora d? ion in an octahedral crystal field (10D, — 1.1 eV)calculated 
using the symmetry adapted basis set 


Five fundamental spectra come into play, namely, o (0, 0), 0 (2, 0), o (4, 0), 
o (4, 4), and o (4, —4) as shown in Fig. 4.14. The O, symmetry implies that R(2, 0) is 
always equal to zero as confirmed by the calculation. In addition, R(4, 4) — R(4, —4) 
and are proportional to R(4, 0) as can be seen from (4.75), (4.82), and (4.83). There- 
fore, as can be expected from group theory, only two fundamental spectra are required 
to fully describe the system. 

Let us investigate the angular dependence of a quadrupole transition in an Op 
crystal field considering two scattering geometries. In the first geometry, the wave 
vector (k) is aligned parallel to the [100] direction and the polarization (€) is rotated 
inthe z — y-plane as illustrated in the right panel of Fig. 4.16. Despite the presence of 
non-isotropic fundamental spectra [o (4, 0), o (4, 4), and o (4, —4)], the XAS cross- 
section is constant in these settings as shown in Fig. 4.16a. In the second geometry we 
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Fig. 4.14 Fundamental spectra for a quadrupole transition in a 3d? ion in an octahedral crystal 
field. Re and Im are the real and imaginary parts of the spectra 


have k || [5 50] and e is rotated about the [55 3501 axis as depicted in Fig. 4. 16b. 
The XAS cross section shows a clear twofold angular dependence in these settings. 

It is interesting to discuss the difference between both scattering geometries and 
the reason behind the absence of angular dependence in the first case. In Fig. 4.15, 
we plot the angular dependence of the light tensor [E (4, 0), E(4, 4), and E(4, —4)] 
for both cases. The contribution of the term E (4, 0) is 90° out-of-phase with respect 
to the terms E(4, 4) and E(4, —4) for the first scattering geometry [see panel (a) 


of Fig. 4.15]. The O;, symmetry implies that the ratio between the R(4, 0) and the 
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Fig. 4.15 Angular dependence of E(4, 0), E(4, 4), and E(4, —4) terms. a The wave vector (k) 
is aligned with [100]. The angular dependence is computed by rotating the polarization (e) about 


[100] with 0 = 0° for e || [001]. b k is aligned with [75 7501. The angular dependence is computed 
by rotating € about [75 z with 0 = 0° for e || [001] 


R(4, +4) terms leads to a constant XAS cross section. On the other hand, as depicted 
in Fig. 4.15b, all terms are in phase which leads to an angular dependent XAS. 

An important distinction between the dipole and quadrupole transitions can be 
concluded from these examples. While a dipole transition in an O, system exhibits 
no angular dependence, the quadrupole transition can show angular dependences 
when the scattering geometry is appropriately chosen. This difference holds because 
a quadrupole transition has higher multipole contributions that give rise to angular 
dependences not observable for dipole transitions. 


Tetragonal Crystal Field 


The effect of reducing the crystal field symmetry to tetragonal by applying a com- 
pressive distortion along the z-axis can be directly seen in the angular dependence of 
the quadrupole transition. In contrast to the case of O, crystal field (see Fig. 4.162), 
now rotating € about the [100] axis shows angular dependence because the z- and 
y-axes are not equivalent (see Fig. 4.17a). However, as could be expected, rotating € 
about the [001] axis shows no angular dependence (see Fig. 4.17b). In this projection, 
the system is effective of O, symmetry. 
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Fig. 4.16 Angular dependence of XAS for a quadrupole transition in a 3d? Op ion with 10D, — 
1.1eV. a The wave vector (k) is aligned with [100]. The angular dependence is computed by 
rotating the polarization vector (€) about [100] with 0 — 0? for e || [001] as depicted in the sketch 
on the right. b k is aligned with [5 2501. The angular dependence is computed by rotating e about 
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[55 ,;9l with 6 = 0° for e || [001] as depicted in the sketch on the right 


Octahedral Crystal Field with Exchange Field || z 


Consider a magnetic 3d? ion where the crystal field is O, with an exchange field 
aligned along the z-axis. Seven fundamental spectra come into play, namely, R(0, 0), 
R(1, 0), R(2, 0), RG, 0), R(4, 0), R(4, —4), and R(4, 4). We have shown previously 
that for Op symmetry, when k is aligned parallel to the [100] direction, and e is rotated 
in the z — y-plane, angular dependence is observed (see Fig. 4.16a). Repeating the 
same calculation with an exchange field aligned along the z-axis leads to an angular 
dependent XAS as shown in Fig. 4.18a. The exchange field reduces the symmetry 
along the z-axis. The effects of rotating the incident linear polarization in the z — y- 
plane on the fundamental cross sections are shown in Fig. 4.19. Only the terms 
o (2, 0), o (4, 0), c (4, 4), and o (4, —4) are non-zero and exhibit a twofold angular 
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Fig. 4.17 Angular dependence of XAS for a quadrupole transition in a 3d? ion in a Da; crystal 
field (10D, = 1.1 eV and D, = —0.2 eV). a The wave vector (k) is aligned with [100]. The angular 
dependence is computed by rotating the polarization (€) about [100] with 9 = 0° for e || [001]. b 
k is aligned with [001] and e is rotated about the [001] with 0 = 0° for e || [100] 


dependence. However, one notes that o (4, 0) is 90? shifted with respect to o (4, 4) 
and o (4, —4) which implies that the angular dependence of the XAS will be small. In 
comparison, no angular dependence is observed when € is rotated in the x — y-plane 
as shown in Fig. 4.18b. 

Finally, the exchange field can give rise to interesting combinations of structural 
and magnetic dichroism effects. Consider aligning k || [001] and measuring XAS 
using circular polarized light. Rotating the system about the [100] axis gives rise to 
unconventional angular dependent XAS as shown in Fig. 4.20. This angular depen- 
dence arises from a combination of structural and magnetic dichroism effects. 

The magnetic contribution arises from the circular dichroism active terms which 
are o (1, 0) ando (3, 0) (see Fig. 4.21a). On the other hand, the structural contribution 
arises from the linear dichroism active terms which are o (4, 0), o (4, 4), and o (4, —4) 
(see Fig. 4.21b). In addition, these terms contribute weakly to the magnetic dichroism. 
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Fig.4.18 Angular dependence of a quadrupole transition for a 3d? ion in Op crystal field (10D, — 
1.1eV) with an exchange field aligned along the z-axis (B = 0.05eV). a The wave vector (k) is 
aligned with [100]. The angular dependence is computed by rotating the polarization vector (€) 
about [100] with 6 = 0° for e || [001]. b & is aligned with [001] and € is rotated about the [001] 
with 0 = 0° for e || [100] 


This is illustrated in Fig. 4.22 where in panel (a) we show the structural dichroism 
signal for the same system without an exchange field and in panel (b) the difference 
between the case with exchange versus without exchange. The magnetic contribution 
is about three orders of magnitude less than the structural contribution for this system. 
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Fig. 4.19 Angular dependence of XAS fundamental cross sections o(1,0), o (2, 0), o (3,0), 
o (4, 0), o (4, 4), and c (4, —4). Here k is aligned with [100] and € is rotated about [100] with 
0 = 0° for e || [001] 
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Fig. 4.20 Angular dependence of XAS for a quadrupole transition in a 34? ion in an Op, crystal 
field (10D; = 1.1 eV) with an exchange field aligned along the z-axis (B = 0.05 eV). The wave 
vector (k) is initially aligned to [001] and polarization (€) is circular. The angular dependence is 
computed by rotating the system about the [100] axis 
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Fig. 4.21 Angular dependence of XAS fundamental spectra for a quadrupole transition in a 3d? 
ion in an Op crystal field with an exchange field aligned along the z-axis. a Circular dichroism 
active terms o (1, 0) and c (3, 0). b Linear dichroism active terms o (4, 0), o (4, 4) and o (4, —4). 
The angular dependence is computed by rotating the system about the [100] axis 
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Fig. 4.22 Angular dependence of XAS for a quadrupole transition in a 3d? ion: a in an Oj crystal 
field (10D, = 1.1 eV). b The difference between calculation (a) and in an Op crystal field with an 
exchange field aligned along the z-axis (10D, = 1.1 eV and B = 0.05 eV). The wave vector (k) 
is initially aligned to [001] and polarization (€) is circularly polarized. The angular dependence is 
computed by rotating the system about the [100] axis 
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4.3 Conclusion 


We have expressed the XAS cross section using Green's function formalism and 
spherical tensors, which allows a tractable investigation of the different types of 
dichroism: on the one hand experimentally, by allowing the experimentalist to pre- 
dict the smallest set of measurements required to recover the full spectroscopic 
information and therefore to optimize beamtime; on the other hand theoretically, 
using modern core level spectroscopy codes, by allowing a more efficient calcula- 
tion rather than a point-by-point time-consuming treatment. In a forthcoming work, 
we intend to use a similar approach for resonant inelastic X-ray scattering (RIXS) 
spectroscopy, whose richness lies in the large number of possible spectra that can 
be obtained by varying the energy, direction, and polarization state of the incident 
and scattered beams [28]. But as a matter of fact, there is so much information in 
the spectra that it is difficult to know whether a specific set of experiments measures 
all potential information. In our opinion, the possibilities offered by angular and 
polarization dependent RIXS measurements have not yet been exploited to the best 
of their potential. Nevertheless, the technique has now become mature and popular 
and will benefit from the huge instrumentation progress achieved in the last years, 
which open new doors for the exploration of dichroims in materials. 
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Chapter 5 A) 
Spintronics and Synchrotron Radiation S 


Richard Mattana, Nicolas Locatelli, and Vincent Cros 


Abstract Having access to the electronic and magnetic properties of spintronic 
systems is of crucial importance in view of their future technological developments. 
Our purpose in this chapter is to elaborate how a variety of synchrotron radiation- 
based measurements provides powerful and often unique techniques to probe them. 
We first introduce general concepts in spintronics and present some of the important 
scientific advances achieved in the last 30 years. Then we will describe some of 
the key investigations using synchrotron radiation concerning voltage control of 
magnetism, spin-charge conversion and current-driven magnetization dynamics. 


5.1 General Introduction to Spintronics: From 
Magnetoresistive Effects to the Physics of 
Spin-Transfer Phenomena 


Whereas electronics relies on the charge of carriers (electrons and holes) to cre- 
ate and convey information, in spintronics the spin of these carriers is also used. 
The generation of spin currents, their detection as well as their manipulation are the 
basic principles of any spintronic devices. Since the discovery of giant magnetoresis- 
tance (GMR) effect in 1988, several breakthroughs have been achieved and strongly 
impacted several domains of applications such as, e.g. magnetic sensors, hard-disk 
drive read heads and more recently nonvolatile memories to cite the most emblematic 
ones. In this first part, we aim at introducing the basic principles of GMR and some 
of the pioneer experiments. Then we present some other important breakthroughs 
such as tunnelling magnetoresistive effects and the manipulation of magnetization 
without the application of a magnetic field but using spin-transfer effects. 
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5.1.1 Giant Magnetoresistance: An Historical Point of View 


5.1.1.1 Electric Conduction in Ferromagnets 


In 3d ferromagnetic (FM) metals, electric conduction comes mainly from 4s band 
electrons, whereas magnetism originates from electrons in 3d bands. Resistivity 
in such materials arises from scattering events of carriers on impurities, phonons 
and local potentials. In particular, localized d orbitals act as diffusive centres for 
the s electrons which carry most of the charge current. Moreover, the non-zero net 
local magnetization comes up with a shift of the d-bands for the two electron spins 
(up and down), leading to different densities of states (DOS) at the Fermi level 
(Fig.5.1 left). At low temperature, the electron spin is observed to be conserved 
during different sources of scattering (electron-phonon, electron-electron, electron— 
magnon, ...), allowing to describe the conduction properties in a ferromagnet using 
a "two-current" model [1]. Spin-up and spin-down electrons carry the current in two 
separated channels (Fig. 5.1 right). The difference in DOS of 3d electrons for spin-up 
and spin-down channels then leads to different number of scattering events for the 
two sub-bands, and thus different resistivities. Conduction in ferromagnet can hence 
be characterized by the spin-asymmetry coefficient o defined by the ratio between 
the resistivity of spin-down channel over the resistivity of spin-up channel. The spin- 
asymmetry coefficient is then strongly correlated to the asymmetry of the DOS at the 
Fermi level. For example, Fe has a spin-asymmetry coefficient lower than 1 whereas 
for Co and Ni this coefficient is larger than 1. 

The experimental demonstration of the two-current model has been provided by 
a series of experiments in which Fert and Campbell showed that the conduction 
of a spin channel can be tuned by introducing different types of magnetic impu- 
rity [2-5]. Depending on the introduction of two different impurities with similar 
or opposite spin asymmetries, the impact on the two spin channels will be differ- 
ent (Fig. 5.2). Let us take a Ni matrix (v > 1) doped with Co impurities (a > 1) 
and Rh impurities (y < 1). With the presence of only Co impurities, scattering in 


Density of states 


Fig. 5.1 (left) Scheme of the density of states in a ferromagnetic metal which has d bands shifted. 
(right) Scheme illustrating the two-current model: spin-up and spin-down electrons carry the current 
in separated channels. As resistivity arises from s-d transition and is proportional to the 3d DOS, 
the resistivity of a ferromagnet can be schematized by two resistances (o4 and p) in parallel 
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Fig. 5.2 Tailoring the channel conduction with impurities. a Resistivity of Ni as a function of Co 
and Rh doping. Co and Rh having opposite spin asymmetries, scattering is significant for both spin- 
up and spin-down channels b and c leading to an increase of resistivity. The resistivity dependence 
as a function of Co and Rh concentration is not linear a. d Resistivity of Ni as a function of Co and 
Au doping. Co and Au having similar spin asymmetries, electrons of only spin-down channel are 
strongly scattered e. The spin-up channel is providing a short-circuit for the electrons f. A small 
linear dependence of the resistivity with Co and Au impurity concentrations is then obtained d. 
Adapted from [5] with permission (Copyright 2008, American Physical Society) 


the spin-down channel is enhanced, but the spin-up channel still experiences low 
resistivity: the resulting global resistivity is low (short-circuit effect). As Rh impuri- 
ties are introduced, scattering becomes significant for both channels [Fig. 5.2(b,c)], 
progressively suppressing the short-circuit effect, and the global resistivity quickly 
increases. The resistivity dependence as a function of Co and Rh concentrations is 
not linear [Fig.5.2a]. Instead, when Rh impurities are replaced by Au ones (o > 1, 
i.e. more electrons of the down channel are scattered, like for Ni and Co), such an 
enhancement of resistivity is not observed. Only electrons of the spin-down channel 
are strongly scattered by Co and Au impurities and a small linear dependence of 
the resistivity with Co and Au impurity concentrations is observed [Fig.5.2d]. The 
spin-up channel is providing a short-circuit for the electrons [Fig.5.2e, f]. 

These experiments indeed brought experimental demonstration of the validity of 
the two-current model and the proof that the mean resistivity can be varied by tailoring 
of the channel conduction through impurities. These pioneer experiments can be 
indeed considered as the pre-concept of the GMR effect in which different resistance 
states will be achievable by applying a magnetic field in a single heterostructure. 

Thanks to the development of molecular beam epitaxy of metallic materials in 
the late 80's, it became hence possible to grow nanometre-thick magnetic metal- 
lic heterostructures (multilayers) involving several separated FM layers. Controlling 
the magnetic configurations of the magnetic films (parallel or antiparallel configura- 
tions of their magnetizations) then grants the possibility to tailor the spin-dependent 
transport of the system. This has led to the discovery of the GMR effect in 1988. 
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5.1.1.2 First Observation of GMR Effects 


The first studies of GMR effects were done in Fe/Cr superlattices (Fe: FM, Cr: non- 
FM metal). A key point of these heterostructures was the control of the magnetic 
configuration. Exploiting the indirect exchange interaction between the magnetic 
layers transmitted by the conduction electron (called RKKY interaction) through 
the non-magnetic spacer layer, P. Grünberg and co-workers proved in 1986 that it is 
possible to align the magnetization of consecutive Fe layers in opposite direction at 
zero magnetic field by choosing adequate Cr interlayer thickness [6]. The parallel 
magnetic configuration can then be reached by applying a large magnetic field in order 
to overcome the RKKY coupling. First magnetoresistance curves have been obtained 
by the groups of A. Fert in Orsay, France (Fig. 5.3 left) and P. Grünberg in Jüllich, 
Germany (Fig. 5.3 right). A higher resistance state is observed at zero magnetic field 
at which the antiparallel magnetic configuration is stabilized. By applying a magnetic 
field, the parallel magnetic configuration is progressively reached inducing a large 
decrease of the resistance. In the experiments done in Grünberg's group, a GMR 
effect of 1.596 at low temperature was obtained in the case of a Fe/Cr/Fe trilayer 
(Fig. 5.3 right). The striking point is that this effect is about one order of magnitude 
larger in amplitude than the anisotropic magnetoresistance of a single Fe thin film 
(also shown on Fig. 5.3 right). In Fert's group, measurements were performed in 
Fe/Cr superlattices instead of trilayers, which allowed to considerably increase the 
amplitude of the effect that reached 80% for sixty Fe/Cr bilayers (Fig. 5.3 left). 

An important development towards the implementation of the GMR systems for 
applications in read heads for hard-disk drives or sensors was made in 1991 by 
B. Dieny et al. [7] at IBM who reported the first observation of GMR in simple 
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Fig. 5.3 Pioneer observation of GMR in Fe/Cr superlattices (left) and in Fe/Cr/Fe trilayer (right). 
At zero magnetic field, magnetization of consecutive layers is pointing in opposite direction leading 
to a high resistance state. By applying a magnetic field, a parallel magnetic configuration is reached 
and so a low resistance state is obtained. For a Fe/Cr/Fe trilayer, the GMR ratio is about 1.5% and 
one order of magnitude larger than the anisotropic magnetoresistance of a single Fe layer (right). 
By using Fe/Cr superlattices (60 Fe/Cr repetitions) the GMR ratio reaches 80% at low temperature. 
Reproduced from [9] (left) and [10] (right) with permission (Copyright 1988 and 1989, American 
Physical Society) 
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trilayered structures with two distinct coercive fields, later called spin valves. In 
these structures, one of the layers switches using low fields while the second is stable 
up to large fields. It is to be emphasized that the time between the first discovery 
in a lab and the use of the GMR effect in cutting-edge technology devices has been 
extremely fast as less than 10 years after the discovery of GMR, and IBM introduced 
in 1997 spin valves in read heads of hard-disk drives. GMR-based magnetic sensors 
is now used in a multitude of applications such as monitoring wheel speed, detecting 
charge current, and fluid flow. 

The research on magnetic multilayers and GMR became rapidly a very hot topic. 
It is not our purpose here to make here a review of all experimental and theoretical 
results that followed up the pioneer results. A complete review can be consulted 
in [8]. 


5.1.1.3 A Simple Model to Describe the GMR 


The two-current model, described previously, was developed to explain the spin- 
dependent resistivity in magnetic materials. It can, in a rather simple way, be adapted 
to describe the giant magnetoresistive effect in these magnetic multilayers. This 
model is based on two assumptions: 1) o > 1: the minority-spin electrons (opposed 
to local magnetization) are more scattered than those of majority spin (aligned with 
local magnetization); 2) the spin is conserved when the electrons are scattered. These 
two conditions are fulfilled at low temperature. 

Two geometries can be considered to evaluate the resistance of such a structure: 
either the current flows in the direction of the layer planes (known as "current-in- 
plane GMR", CIP-GMR), or the current flows in a direction perpendicular to the 
layer plane (known as "current-perpendicular-to-the-plane GMR", CPP-GMR). A 
similar description can be used to account for the magnetoresistive properties for 
both geometries, without entering into the fine discussion about the actual physical 
spin-transport mechanisms, as long as the layer thickness remains small compared 
to a characteristic length associated with each geometry: the mean free path for the 
CIP case and the spin-flip length for the CCP case [11]. 

Let us note r the resistance of a FM layer for the majority-spin channel and R the 
resistance for the minority-spin channel, with r < R. Here the resistance of the NM 
separating layer is neglected. As illustrated in Fig. 5.4, an electron will have to pass 
through adjacent FM layers. Depending on whether these layers have parallel (P) or 
anti-parallel (AP) magnetizations, the resulting resistance shall differ. 

In the (P) configuration, the electrons with spin (+) and (||) are either the electron 
spin majority or minority in all magnetic layers. Spin (f) electrons then experi- 
ence identical resistances r} = r when crossing each magnetic layer, while spin (|) 
electrons experience identical resistances r} = R. The mean resistance then writes 
rp = A In case of materials with large spin asymmetry (o > | andr < R), the 
multilayer is short-biased by the spin (f) channel and the total resistance is rp © r. 

For the (AP) configuration, the electrons of the two (f) and (J) channels cor- 
respond alternately to electron spin majority and minority in consecutive magnetic 
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Parallel configuration (P) Anti-parallel configuration (AP) 


Fig. 5.4 Illustration of the two-current model: conduction of an electron in a ferromagnetic 
metal/normal metal/ferromagnetic metal (FM/NM/FM) multilayer. The electrons with a spin aligned 
parallel (resp. anti-parallel) to the local magnetization see a resistance r (resp. R) through this 
magnetic layer. The equivalent resistance circuit is presented for two configurations: consecutive 
ferromagnetic layers with parallel magnetization or anti-parallel magnetization 


layers, and thus the effect of short-circuit by one of the channels is suppressed. Con- 


sequently the two channels have the same resistance Rtr and the total resistance is 
Rtr , Which is in general much larger than rp =r. 


This brings the possibility of switching between high and low resistance states by 
simply changing the relative orientation of the magnetization of consecutive layers. 
If we can engineer these structures to stabilize the two (P) and (AP) magnetic states 
in absence of any external field, then such system defines a magnetic bit to store 
the information, using the powerful stability of the magnets, already well known 
through the development of hard-disk drives. The state can then be read through a 
very simple process, i.e. by measuring the resistance of the stack. 

Following this model, the amplitude of the GMR ratio can be simply deduced 


Rap—R R—-ry 
Gui TRE 2 =) (5.1) 
Rp ARr 


In short, the understanding and expertise acquired during the 60s and 70s for 
tailoring the spin-transport properties combined with the development of metallic 
magnetic multilayers in the 80s has led to the discovery of the GMR effect in 1988. 
Less than 10 years after the discovery of the GMR effect considered as the birth 
of spintronics, this effect has largely participated to the explosion of the amount of 
data storage in the mid-90s through the commercialization of the first generation of 
spintronic devices such as magnetic sensors used as read heads in hard-disk drives. 
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5.1.2 Tunnelling Magnetoresistance 


The pioneer observation of tunnelling magnetoresistance (TMR) in magnetic tunnel 
junctions (MTJs) at room temperature in 1995 is considered to be the second break- 
through in spintronics, leading to a second generation of spintronic applications such 
as magnetic random access memory (MRAM). In this section, we describe the first 
TMR measurements, then the two standard models describing TMR effects. 


5.1.2.1 First Experimental and Theoretical Studies 


A magnetic tunnel junction is a device in which two FM electrodes are separated 
by an ultra-thin insulating barrier. The first MTJ exhibiting TMR has indeed been 
reported in 1975 by M. Julliére [12] who measured a TMR ratio of 14% at 4.2K in 
a Fe/Ge/Co junction. Twenty years later TMR effects have been observed at room 
temperature by J. S. Moodera [13] and T. Miyazaki [14] using amorphous alumina as 
tunnel barrier. In Fig. 5.5, the TMR curve obtained for a CoFe/Al,03/Co MTJ [13] 
is displayed. The TMR ratio is around 12%, at room temperature. This increase in 
the resistance difference between the two states allows an easier detection of them 
and let appear the potential of such spintronic devices for memory applications. 

Such TMR effect has been first explained by M. Julliére in 1975. Keeping the 
free-electron approximation, he proposed an additional assumption: the electron spin 
conservation during the tunnelling process, meaning that electrons can tunnel from 
one FM electrode toward the second FM electrode only into empty states having 
identical spins. This simple model easily allows to understand that the tunnelling 
current in the (P) and (AP) alignments of magnetizations will differ. 

As illustrated in Fig. 5.5, for the parallel (P) configuration of the magnetizations: 
when a small voltage is applied, majority-spin (denoted as 7) electrons from the 
injector will tunnel toward majority-spin (+) empty states of the collector, allowing 
"high" current. Atthe same time, minority-spin electrons from the injector will tunnel 
toward minority-spin (|) empty states of the collector, allowing "low" current.! The 
conductance then writes 


Gp « DÌ (Eg) D (Ep) + DI (Ep) D} (Er) - (5.2) 


In the case of antiparallel (AP) configuration of the magnetizations, majority-spin 
(t) electrons from the injector will tunnel toward minority-spin (|) empty states of the 
collector. Minority-spin (|) electrons from the injector will tunnel toward majority- 
spin (7) empty states of the collector. Both currents are then low. The conductance 
then writes 


! Note here that the arrow does not define any direction of the spin, but rather the fact that it is 
aligned with the local magnetization. In the (AP) case, a spin with one direction will correspond to 
(1) spin in one electrode and (|) spin in the other. 
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Fig. 5.5 TMR curve of a CoFe/Al;O3/Co MTJ recorded at room temperature [Adapted from [13] 
with permission (Copyright 1995, American Physical Society)]. Schematic of the spin-dependent 
tunnelling process through an insulating barrier when their magnetizations are aligned parallel (left) 
and antiparallel (right) to one another. The process is assumed to be purely elastic, so that no mixing 
of spin states occurs during the tunnelling process 


Gap x DÌ (EF) D} (Er) + DI (Ep) D} (Ep) . (5.3) 


Using these results for the conductance, the TMR ratio, characterizing the resis- 
tance difference in (P) and (AP) configurations, is expressed as 


Gp—Gap  Rap-Rp | 2B,P, 


TMR= = = : (5.4) 
Gap Rp 1 — PP 
where P; defines the spin polarization for each electrode as 
D] (Er) — D} (E 
p. P (Er) ; (Er) (5.5) 


' Di (Ep) + D} (Er) ` 


A key issue to get an accurate prediction of TMR is to properly estimate the actual 
amplitude and sign of the spin polarization for a given FM material. Obviously, the 
largest the spin polarization is, the highest will be the TMR amplitude. It thus explains 
the strong research activity in the last decade on material science to investigate novel 
families of materials (magnetic oxides, Heusler alloys, etc.) for which some of their 
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compounds are predicted to be half metallic, i.e. to have a 100% spin polarization at 
the Fermi level. 

Julliére's model has enjoyed much success in correctly predicting TMR ampli- 
tudes in most MTJs with amorphous barriers, until the emergence of MTJs based on 
crystalline insulating barriers, notably magnesium oxide (MgO) barriers in the begin- 
ning of the 2000s [15, 16]. The model described previously is actually oversimplified 
in that it considers an identical tunnelling process of every electron, independently 
on their band belonging, hence forgetting about the interplay of the band structures 
throughout the MTJ heterostructure. 


5.1.2.2 Coherent Tunnelling in Epitaxial Magnetic Tunnel Junctions 


Another major breakthrough in the field of spintronics corresponds to the introduc- 
tion of crystalline MgO tunnel barriers, that is today's standard in MTJ implemented 
in MRAMs, read heads and field sensors working with TMR effect. These exper- 
imental developments have been stimulated by some theoretical calculations made 
by W. Butler [17] predicting that huge TMR ratios as large as 1600% are antici- 
pated for epitaxially grown Co or Fe electrodes on crystalline, instead of amorphous 
MgO. These calculations, developed with ab initio methods, derive the tunnelling 
probability of each kind of electrons, depending on their orbital symmetry. 

Contrary to amorphous alumina [Fig. 5.6a left], in crystalline MgO tunnel junc- 
tions, the electron wave functions in the FM material are coupled with evanescent 
wave functions having the same symmetry in the barrier [Fig. 5.6a right]. Through ab 
initio calculations, it was then predicted that the tunnelling probability of an electron 
strongly depends on the orbital symmetry of the electron (of the band it belongs 
to). Beyond the two-current (up-spin and down-spin) model, the system behaves as 
if it exists an independent current channel for each band and each spin, leading to 
a possible effective symmetry filtering of the tunnelling current. The tunnel barrier 
can, therefore, filter the wave functions and thus select the spins in the electronic 
transport. 

This mechanism of orbital selection for the tunnel conductance is presented in 
Fig. 5.6. In Fig. 5.6b, band dispersion of bcc Fe(001) for the minority and majority 
spins is shown. ^, Bloch states are present at the Fermi level only for the majority 
spins. In Fe(100)/MgO/Fe(100) systems, band structure calculations have demon- 
strated that majority-spin electrons [see Fig.5.6c] are mainly filling A; symmetry 
states (hybridized states with spd characters), whereas minority-spin electrons [see 
Fig.5.6d] are filling ^» symmetry states (d type states) and A5 symmetry states 
(hybridized pd states). Moreover, this is crucial for getting a large TMR ratio, the 
tunnelling exponential decay is much stronger for A» and As states compared to 
A, states. For dygo = 8 monolayers, which is a typical barrier thickness that can 
be achieved experimentally, the probability of transmission of A, electrons is larger 
than for As electrons by 10 orders of magnitude. Ultimately, only A, electrons con- 
tribute significantly to the current. It is this filtering effect which can explain the large 
values of TMR expected on epitaxial or highly textured structures and that has made 
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Fig. 5.6 a Schemes illustrating electron tunnelling through an amorphous AI-O barrier (left) and 
through a crystalline MgO barrier (right). b Band dispersion of bcc Fe(001) for the minority and 
majority spins. ^1 Bloch states are present at the Fermi level only for the majority spins. Tunnelling 
DOS for Fe/MgO/Fe at k = 0 for majority ¢ and minority d spins. Decay of A; Bloch states in 
MgO is less attenuated than the A; Bloch states. e TMR of 600% obtained at room temperature for 
a CoFeB/MgO/CoFeB MTJ. Adapted from [18] (a, b) with permission (Copyright 2007, Institute 
of Physics), from [17] (c, d) with permission (Copyright 2001, American Physical Society) and 
from [19] (e) with permission (Copyright 2008, American Institute of Physics Publishing) 


the MgO-based tunnel junctions at the core of the development of new spintronic 
devices like the MRAMs. 

After these theoretical predictions, a strong research effort has been made to 
obtain epitaxial growth of structures for Fe/MgO/Fe or CoFeB/MgO/CoFeB [20, 
21]. These efforts have resulted in a TMR of about 600% obtained in 2009 [19] in 
CoFeB/MgO/CoFeB MTJs at room temperature [see Fig.5.6e]. An excellent review 
on the physics of tunnelling transport in MgO-based systems and the experimental 
state of the art for TMR has been written by S. Yuasa [18]. 

Since the measurements of TMR effects at room temperature in 1995, a lot of 
work has been done in order to both increase the TMR ratio and reduce the MTJ 
resistance. This has been achieved by developing high-quality crystalline MgO-based 
MTJs. This effort has led to the development of a new class of magnetic memories 
called MRAMs. 
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5.1.3 Magnetization Manipulation without Magnetic Fields 


For practical applications, e.g. MRAMs, the possibility to manipulate the magne- 
tization of FM electrodes in spin valves or MTJs without using magnetic field is 
required. This corresponds to the next breakthrough with the prediction and the 
observation of spin-transfer effects, first providing a new way to reverse the magne- 
tization in a nanostructure but also to generate in some cases steady precession of the 
magnetization. In this section, we briefly present the physics of spin-transfer torque 
(STT) and also discuss an alternative approach consisting in the manipulation of the 
magnetization through the application of a voltage. 


5.1.3.1 Spin-Transfer Torque 


In the first generation of MRAM devices developed in 2004, the magnetization rever- 
sal between the two possible magnetic states was realized by using local magnetic 
fields generated by electrical currents flowing in lines close to each magnetic ele- 
ment. This writing process rapidly suffered from both the large energy consumption 
needed to generate large enough magnetic fields and from cross-talk problems due to 
the difficulty to write a single bit. A solution came out from a major breakthrough in 
spintronics in 1996 when it has been proposed that a spin-polarized current flowing 
through magnetic multilayers provides a new way to manipulate the magnetization 
of a ferromagnet. This new effect, which was called spin-transfer effect, has become 
rapidly a very hot topic. Moreover, it is at the basis of a new generation of magnetic 
memories, called spin-transfer torque-MRAMs (STT-MRAMs). 

The concept of spin-transfer effect was proposed in 1996, concomitantly by 
J. Slonczewski [22] and L. Berger [23]. To describe this effect, one can take a stan- 
dard spintronic structure composed of a fixed FM electrode F; and a free FM layer Fz 
separated by a NM spacer. When the electrons flow from layer F; to Fy, the current 
becomes spin-polarized after passing through F,. This non-zero spin polarization is 
aligned along the magnetization direction in F; and propagates into the NM metallic 
spacer or tunnels in the case of an insulating barrier, so that it arrives at the NM/F 
interface. If the magnetization M, and M, are non-collinear, it results that a compo- 
nent of the spin current transverse to F, exists at the NM/F» interface [green arrows in 
Fig.5.7a]. When the electrons penetrate into F2, the spin of the conduction electron 
becomes aligned over a very short distance (within a few atomic distances) toward 
the magnetization direction in Fy because of a strong exchange interaction between 
conducting (s) electrons and localized (d) electrons, the latter being responsible of 
the magnetic moments. 

During all this process, the total spin angular momentum is conserved. Thus the 
transverse component of the spin current Am lost by the electrons when passing 
through F, is indeed absorbed and transferred to the local magnetization of Fo. 
This transfer of spin angular momentum results in a torque exerted by the spin- 
polarized current on the local magnetization. For this current polarity, when the 
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Fig.5.7 aSchemeofa magnetic trilayer structure for illustrating the concept of spin-transfer torque. 
b The transverse component of the angular moment An of the spin-polarized current is transferred 


to the magnetization. It results in a torque exerted on the magnetization M2, of the thin magnetic 
— 


layer, which aligns along magnetization M for positive current. c Schematic representation of the 
torques acting on the magnetization including spin-transfer effect 


— 
current increases enough, it tends to align the magnetization M2 along the direction 


of the spin polarization of the current, i.e. along the magnetization M, [Fig.5.7b]. As 
all the process occurs in the first atomic planes after the interface, the spin-transfer 
mechanism is an interfacial effect. 

After having introduced the spin-dependent transport mechanisms at the origin 
of STT, our aim is now to address the influence of this torque on the magnetization 
dynamics of the layer F2. A classical approach to describe the dynamical motion of a 
magnetization is a differential equation, named the Landau—Lifschitz—Gilbert equa- 
tion, to which we add Slonczewski's component of spin torque.? This equation has the 


?For simplicity we have only introduced the main STT called "in-plane torque". A second spin 
torque, called "field-like torque" or "out-of-plane torque", is similar to a torque exerted by a field 


along the magnetization of F> and its action might, therefore, be included into H err. 
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following form, taking magnetizations normalized to the saturation magnetization, 
mj=1,2 for the two FM layers 


dm? (m x B )+ a ee Jh gup [ms x (m5 x y 
— -—— ; m i m m 
di youn, ef f mMg 2 dt spin, 5 Dm 2 2 1 
(5.6) 


The first term corresponds to the tangential force describing the magnetization 


precession around effective field H, ff [green arrow in Fig.5.7c], which takes into 
account the external applied magnetic field, magnetic anisotropy fields, coupling 
fields, etc. The second term gives a phenomenological description of magnetization 
dissipation of the system. The coefficient v, named Gilbert damping, (about 107? 
for standard FM materials), describes the damping rate of the motion of im towards 


the equilibrium position oriented along H ,;; [blue arrow in Fig.5.7c]. The mag- 
netization relaxation is induced by a damping force tangential to the magnetization 
trajectory. The third term is the so-called Slonczewski torque (or in-plane torque) 
where upg is the Bohr magneton, f the layer thickness, J the injected current density, 
and P;,;, the amplitude of spin polarization at the interface NM/F» and Msz the mag- 
netization of the ferromagnet F2. This simplified description allows making clear the 
nature of the main contribution of the spin-transfer force that can be described as 
a non-conservative force acting in the same direction than the natural damping, i.e. 
perpendicularly to the magnetization trajectory. Depending on the sign of the injected 
current, the spin-transfer torque decreases or increases the effective damping [purple 
and red arrows in Fig. 5.7c]; it favours the stability of the parallel or the antiparallel 
magnetic configuration. 

For large current density (7107 A cm"? for a typical material system), the STT 
fully compensates the damping torque, steady magnetization precession occurring at 
the ferromagnetic frequency (typical in the GHz range for the ferromagnetic materials 
andinthe THz range for antiferromagnetic ones) can be established. The spin transfer 
induced magnetization dynamics can convert into oscillations of resistance through 
the magnetoresistive effect described previously, and in turn into a radiofrequency 
voltage signal. Spin-torque effect thus makes it possible to convert a dc current into 
a rf voltage and so to build microwave oscillators at the nanoscale. 

For larger current density, the torque can become sufficient to commute the 
magnetization between the two stable configurations. A negative current will, for 
instance, destabilize the parallel magnetization configuration, while stabilizing the 
anti-parallel configuration, allowing to commute from P to AP configuration. Iden- 
tically, a positive current allows to commute from AP to P configuration. 

First experimental results allowing to confirm the theoretical predictions of the 
existence of STTs have been obtained by M. Tsoi et al., using point contact geome- 
try for injection of a large current into a magnetic layer [24]. Then after, it has been 
demonstrated that magnetization commutation can be achieved back and forth by the 
STT effect in Co/Cu/Co spin valves [25]. Figure 5.8a represents magnetization rever- 
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Fig. 5.8 a Current-induced magnetization switching in a Co/Cu/Co nanopillar spin valve. Jumps 
in the curves correspond to the magnetization reversal of the ferromagnetic free layer. b Current- 
induced magnetization switching in a CoFeB2.5/AIOx/CoFeB? 5 MTJ. c Domain wall motion 
induced by a current in Co/Cu/Py spin valves. State 1 and 2 correspond to different positions of 
the domain walls in the strip. By applying a large enough current it is possible to move the domain 
walls and thus reach the parallel (P) or antiparallel (AP) magnetic configuration. Reproduced from 
[25] (a) with permission (Copyright 2000, American Physical Society), from [26] (b) and [27] (c) 
with permission (Copyright 2005 and 2002, American Institute of Physics Publishing) 


sal induced by a current for two different applied magnetic fields. Current density 
in the 107 A cm~? range corresponding to few mA for their nanopillars is needed to 
switch the Co free electrode magnetization. Few years later similar results have been 
obtained in MTJs using AlOx and MgO tunnel barriers [26]. Figure 5.8b illustrates 
the magnetization reversal by the current for a CoOFeB/AIOx/CoFeB magnetic tunnel 
junction. A smaller current (less than 1 mA) is needed to commute the magnetization 
and thus to switch between the parallel and antiparallel magnetic configurations. 
This result shows that STT effects can be used not only in metallic spin valves but 
also in MTJs and has opened the door for the development of new STT-MRAM tech- 
nologies. Finally, STT has been also used to induce magnetic domain wall motion. 
Figure 5.8c represents the first observation of domain wall motion by spin transfer 
in a Co/Cu/Py trilayer, where Py stands for permalloy (NigoFe»o) [27]. Starting from 
configuration 1 or 2 (the domain wall is located at the two-third of the strip) it is 
possible to move the domain wall by a current to reach the parallel (P) or antiparallel 
(AP) magnetic configuration. The current density needed to move these domain walls 
is about 10’ A cm^?. Note that for all these pioneer experiments, a magnetic field 
is applied. Since then, a lot of work has been done and now magnetization manipu- 
lation without applying magnetic field is feasible. Later, STT has been also used to 
manipulate other magnetic textures such as magnetic vortices [28] or skyrmions, or 
generate dynamics [29, 30]. To conclude, STT is now used to write the information 
of a single bit in magnetic memories.? 


For more information on the latest MRAM development, see https://www.mram-info.com/ 
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Fig. 5.9 (left panel) Voltage can be used to modify magnetic anisotropy, coercive fields, magne- 
tization values, exchange bias, Curie temperature or magnetoresistance. This can be achieved by 
different mechanisms such as charge, strain exchange coupling, orbital ordering or electromigration 
(right panel). Reproduced from [31] with permission (Copyright 2017, Elsevier) 


5.1.3.2 Voltage Control of Magnetism 


Although STT is very efficient to manipulate magnetization, alternative or comple- 
mentary approaches have been explored such as the voltage control of magnetism. 
Magnetic anisotropy, coercive fields, magnetization magnitude, exchange bias or 
Curie temperature can be tuned by a voltage [Fig.5.9 (left panel)]. Different mech- 
anisms such as charge accumulation/depletion, electromigration, strain or orbital 
ordering are involved [Fig.5.9 (right panel)]. 

The samples to study the effect of an electric field on the magnetization are gener- 
ally composed of an ultra-thin FM film in contact with either dielectric, ferroelectric 
or piezoelectric materials. A wide variety of FM materials (metals, oxides or diluted 
magnetic semiconductors) have been also studied [31]. Depending on the mechanism 
used to control magnetism through an applied voltage, the ferromagnet thickness has 
to be adjusted. As charge, orbital ordering or electrochemistry are mainly interfa- 
cial effects, the thickness of the ferromagnet should be close to the screening length 
which is in the angstrom range for metal and in nanometre range for dilute magnetic 
semiconductors. Thicker ferromagnets, up to the micrometre range, can be used if 
strain is involved. In the following, several examples illustrating the variety of devices 
studied are displayed. Readers are invited to refer to the article of C. Song et al. [31] 
for a complete review. 

In Fig. 5.10, we show different examples where the Curie temperature, the coercive 
field and the magnetic anisotropy are modified by an electric field. This magnetiza- 
tion control involves different mechanisms such as charge or strain effects. One of 
the first demonstration of magnetization manipulation by an electric field has been 
done using a dilute magnetic semiconductor [32]. In Fig. 5.10a, b, Hall effect curves 
recorded at different gate voltages for a (In, Mn)As-based field-effect transistor are 
presented. Magnetic properties and notably the Curie temperature of the diluted 
magnetic semiconductor (In, Mn)As being dependent on the hole density, it is thus 
possible to tune the Curie temperature by an electric field modulating the carrier den- 
sity [33]. Setting the sample temperature close to its Curie temperature, the switching 
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Fig. 5.10 Examples of voltage control of the magnetization. a, b Hall effect curves recorded for 
different gate voltages in a dilute magnetic semiconductor (In, Mn) As field-effect transistor. The 
gate voltage induces a modulation of the hole concentration in (In, Mn)As and thus a modulation of 
the Curie temperature. c, d Variation of the coercive field of a FePt thin film as a function of bias volt- 
age. The bias voltage modifies the number of 3d electrons and thus changes the magnetocrystalline 
anisotropy. e, f Modification of the magnetic anisotropy induced by strain ina FeGaB/PZN-PT multi- 


ferroic heterostructure. The voltage leads to lattice modulation of the PZN-PT ferroelectric inducing 


a strain modulation of the magnetostrictive FeGaB layer. Under an electric field of 8kV cm™! a 


magnetic field of 70mT is needed to align the magnetization along the [100] direction. Adapted 
from [32] (a, b) with permission (Copyright 2000, Nature Publishing Group), from [34] (c, d) with 
permission (Copyright 2007, American Association for the Advancement of Science), from [31] 
(e) with permission (Copyright 2017, Elsevier) and from [35] (f) with permission (Copyright 2009, 
John Wiley and Sons) 


between ferromagnetic and paramagnetic states can be realized by applying, respec- 
tively, a negative and a positive gate voltage (125 V). A similar approach has been 
investigated to tune the coercive field of the FePt FM intermetallic compound [34]. 
Using an electrolyte to modify electron density at the FePt interface, a modification of 
4.5% of the coercive field is observed [Fig.5.10c, d]. The variation in the number of 
3d electrons directly affects the magneto-crystalline anisotropy and thus the coercive 
field. Interestingly, this device geometry allows tuning the magnetic properties of a 
ferromagnet by applying a low bias voltage (below 1 V). Although the modification 
of the coercitive film is quite small, this first demonstration of voltage control of a FM 
metal was quite encouraging as it opens the door to room temperature modulation 
of magnetism by applying small voltage. Another approach to modify the magnetic 
anisotropy is to apply a strain. By combining ferroelectric and magnetostrictive mate- 
rials, it is possible to modify the magnetic properties by a voltage. Under voltage, the 
lattice of the ferroelectric layer is modulated through the inverse piezoelectric effect 
and will thus induce a strain modulation in the ferromagnet. In Fig.5.10(e,f), we 
show an example where the in-plane magnetic anisotropy is modified by an electric 
field. Whereas at zero electric field, the remanent magnetization is along the [100] 
direction, an electric field of 8 kV cm~! allows rotating the magnetization and so a 
magnetic field of 70 mT is now needed to saturate it along the [100] direction. 
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The voltage control of magnetization is obviously less mature than STT or spin- 
orbit torque (SOT) technologies. However, the large variety of ferromagnets (metals, 
semiconductors, oxides) and gating materials (dielectric, ferroelectric, electrolyte) 
that can be used, makes this field fascinating and promising to reduce power con- 
sumption in memory technologies, which remains still a crucial issue. 


5.1.4. Summary 


We have introduced some breakthroughs in the field of spintronics achieved dur- 
ing the last 30 years. Giant magnetoresistance has emerged rapidly as a promising 
effect to build efficient magnetic sensors working at room temperature. The ability 
to manipulate magnetization by STT in MTJs has allowed developing new magnetic 
memories such as MRAMSs that are now commercially available. This is of course not 
exhaustive and there are a lot of other exciting and promising research fields such as 
spin-orbitronics, magnonics, molecular spintronics or antiferromagnetic spintronics, 
to cite only few of them. Spin-orbitronics is certainly the most active field nowadays. 
Emblematic topics are conversion between charge and spin currents, spin-polarized 
surface and interface states or novel chiral magnetic textures (skyrmions and domain 
walls). See for example A. Soumyanarayanan et al. for a review [36]. 

In all the spintronic effects that we have just introduced, like in many other fields, 
interfaces play a key role, and therefore a deep knowledge of the electronic and 
magnetic properties of these interfaces is desired. Synchrotron radiation-based mea- 
surements such as absorption and photoelectron spectroscopies and microscopies are 
powerful techniques to probe these interfaces. 


5.2 Examples of Synchrotron Radiation Contribution to 
Spintronics 


In the following, several examples for which synchrotron radiation-based measure- 
ments have allowed a better understanding of spintronic devices are presented. We 
have selected three spintronics topics: (1) voltage control of magnetism; (ii) spintron- 
ics with pure spin currents; (iii) current-driven magnetization dynamics. 
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5.2.1 Voltage Control of Magnetism 


5.2.1.4 Effect of Charge Accumulation/Depletion and Electromigration 


Perpendicular magnetic anisotropy (PMA) has been widely investigated in 
NM/FM/oxide heterostructures. For example, by varying the oxidation time of top 
Al layer in Pt/Co/AlOx devices, it has been demonstrated that the PMA is induced 
by Co-O bonds [37]. Figure 5.11(a,b) represent extraordinary Hall effect curves 
and X-ray absorption spectroscopy (XAS) measurements at the Co L23 edges for 
Pt/Co/AIOx samples with different oxidation times. The extraordinary Hall effect 
allows detecting the magnetic easy axis and XAS measurements probe the elec- 
tronic properties of the cobalt layer. A metallic Co/AI interface resulting from Al 
under-oxidation, or the formation of a CoO layer due to over-oxidation, both pro- 
duce magnetization aligned within the plane of the film. Hence optimized oxidation, 
leading to Co-O-AI bonds, induces a perpendicular magnetic anisotropy. Instead of 
playing with oxidation time, D. Chiba et al. [38] have shown that it was possible to 
switch the easy magnetic axis from in-plane to out-of-plane by applying a bias volt- 
age. In Fig. 5.11c, we show that a bias voltage of 10 V allows to modify the magnetic 
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Fig. 5.11 a Extraordinary Hall effect as a function of oxidation time measured in a Pt/Co/AlOx 
trilayer. b XAS spectra recorded at the Co L2,3 edges for different oxidation times. These measure- 
ments show that optimized Al oxidation induces perpendicular magnetic anisotropy due to Co-O 
bonds. c Switching between in-plane magnetization to out-of-plane magnetization by applying a 
bias voltage for a Pt/Co/MgO sample. Reproduced from [37] (a, b) with permission (Copyright 
2008, American Institute of Physics Publishing), and from [38] (c) with permission (Copyright 
2011, Nature Publishing Group) 
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anisotropy in Pt/Co/MgO [38]. This magnetic anisotropy modification with voltage 
can originate from charge accumulation or oxygen electromigration. 

C. Bi et al. have performed Hall resistivity and XAS measurements at the Co 
L3 edge on a Pt/Co/GdOx sample [39]. A clear correlation between the evolution 
of the magnetic anisotropy and the Co oxidation was demonstrated [Fig. 5.12(a,b)]. 
In Fig. 5.12b, one can see that the shape of the Co XAS spectra are modified with 
voltage. Whereas a negative voltage induces some fine structures in the XAS spectra 
indicating a Co oxidation, a positive voltage reduces the Co layer, and therefore Co 
spectra become similar to that of metallic Co. Thus the Co magnetic anisotropy in 
this Pt/Co/GdOx structures can be reversibly controlled by voltage via Co oxidation 
and reduction. This result confirms the earlier experiments performed by F. Bonell 
et al. [40] on Au/CoFe/MgO samples. Hence, in these examples, the modification of 
magnetic anisotropy is rather due to oxygen electromigration than charge accumu- 
lation. 

In V/Fe/MgO devices, S. Miwa et al. [41] have indeed observed a different 
behaviour. A slight change of coercive field with bias voltage has been measured 
[see Fig.5.12c] but without any modification of the Fe L5? 5 XAS and X-ray mag- 
netic circular dichroism (XMCD) spectra with the applied voltage, suggesting that 
Fe is not oxidized [Fig. 5.12d]. This is quite surprising since the electric field applied 
is similar in Pt/Co/GdOx and V/Fe/MgO experiments. This result shows that modi- 
fications of the coercive field is not induced by electromigration but is rather due to 
charge accumulation. 

To conclude, this careful investigation of the electronic properties of the FM/oxide 
interface has allowed to unveil the origin of voltage control of magnetic anisotropy 
in each NM/FM/oxide structures and thus to discriminate between electromigration 
and charge effects. 


5.2.1.2. Effect of Strain 


Strain has been also used to tune magnetic anisotropy and can be very efficient if 
magnetostrictive materials are used. By combining piezoelectric and magnetostric- 
tive materials, it is possible to control the magnetic anisotropy by voltage. Here we 
show an example where photoelectron emission microscopy (PEEM) measurements 
have been performed to highlight the correlation between strain and magnetism [42]. 
A magnetostrictive CoFe Oy, layer has been deposited on a piezoelectric BaTiO} sub- 
strate. BaTiO; can have domains with different unit cell parameters, and therefore 
can induce different domains with different strains in the CoFe20; layer. X-ray lin- 
ear dichroism (XLD) being sensitive to local coordination, different XLD spectra 
will be obtained. In Fig.5.13d, XLD-PEEM'" image recorded at the Fe L5 edge are 


4X AS spectra can be also recorded on the different domains allowing to measure the local electronic 
properties. 
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Fig. 5.12 a Hall resistance for a Pt/Co/GdOx sample as deposited (red) and after an applied bias 
voltage leading to an electric field of —625 kV cm! (blue) and 4-625 kV cm! (purple) showing an 
in-plane to out-of-plane magnetization transition. b XAS and XMCD spectra recorded at the Co L3 
edge. A negative voltage induces an oxidation of the Co layer, whereas a positive voltage reduces 
the Co layer. e Magnetic hysteresis loop of a V/Fe/MgO sample obtained for two bias voltages 
showing a modification of the coercive field. d Fe L2,3 XAS and XMCD spectra recorded at +4 V 
and —4 V. Fe oxidation/reduction is not observed demonstrating that the coercive field modification 
is rather attributed to a charge effect. a, b Adapted from [39] with permission (Copyright 2014, 
American Physical Society) and c, d from [41] with permission (Copyright 2015, American Institute 
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presented showing different strain domains for the CoFe?O, layer induced by the 
BaTiO; substrate. By recording XMCD-PEEM images at the Fe L4 edge, domains 
with different magnetic signals are observed. In Fig. 5.13, it appears clearly that a 
correlation exists between the strain state and the magnetic signal. Black domains 
in the XLD-PEEM [see Fig.5.13d] correspond to a magnetization along the [100] 
direction [see Fig.5.13a], whereas for white domains, the magnetization is along 
the [010] direction [see Fig. 5.13c]. Hence, by performing XLD-PEEM and XMCD- 
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Fig. 5.13 XMCD-PEEM 
(a-c) and XLD-PEEM (d) 
images recorded at Fe L5 
edge for a BaTi03/CoFe204 
sample. XMCD-PEEM 
images show magnetic 
stripes along the [100] and 
[010] directions. These 
magnetic stripes are 
correlated to domains 
Observed in the XLD-PEEM 
image d which indicates 
different strain states of 
CoFe204 induced by the 
BaTiO} substrate. 
Reproduced from [42] with 
permission (Copyright 2012, 
American Physical Society) 
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PEEM measurements, it is possible to locally observe the influence of strain on 
magnetism. 

We have shown that X AS has allowed determining the origin of the voltage control 
of magnetism in NM/FM/oxide structures. In particular, it is possible to disentangle 
charge and electromigration effects. By performing PEEM measurements with both 
circular and linear polarized X-rays, the correlation between strain and magnetism 
can be directly probed. Hence, synchrotron radiation-based measurements are useful 
to better understand the mechanisms of the voltage control of magnetism. 


5.2.2 Spintronics with Pure Spin Current 


The generation of pure spin current from heat, charge current, light or vibration is an 
active and promising research field [43]. A wide variety of materials (ferromagnets, 
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heavy metals, semiconductors, insulators, topological insulators, etc.) and devices 
are currently investigated. In this paragraph, we aim at discussing the spin-charge 
conversion using topological insulators as well as the heat-spin current conversion 
in ferromagnetic insulator/paramagnetic metal devices. 


5.2.2.1 Spin-Charge Conversion 


Conversion between charge and spin currents has been a very active branch of spin- 
tronics in the last couple of years. Such a conversion can be achieved in bulk materials 
(the so-called spin Hall effect) or at interfaces (Edelstein-Rashba effect) relying on 
the spin-orbit interaction and/or extrinsic effects. The studied materials are usually 
NM metals with a large spin-orbit coupling. A second approach is to rely on spin— 
orbit properties at the interfaces either at Rashba interfaces or through the surface 
states of topological insulators (TIs). A complete review of this approach can be 
found in J. Sinova et al. [44] or A. Soumyanarayanan et al. [36]. 

In the following, we provide an example where angle-resolved photoemission 
spectroscopy (ARPES) measurements have allowed to understand the spin-charge 
conversion from the o-Sn TI [see Fig.5.14a]. In this spin-charge conversion study, 
the spin current is generated through the magnetization dynamics induced at the 
magnetization resonance of a Fe layer by an external rf field. The generated spin 
current then diffuses to the w-Sn top surface. Injection of a spin current into a TI 
induces a spin accumulation on one side of the Fermi contour of the Dirac cone as 
well as a spin depletion on the other side [see Fig. 5.14b]. As a consequence, this spin 
injection results in a charge current. Importantly, note that the spin-charge conversion 
is not observed when Fe is deposited directly on o-Sn, but is observed when a thin Ag 
layeris inserted at the interface [see Fig. 5.14c]. By performing ARPES measurement 
to probe the DOS, it has been shown that the deposition of a sub-monolayer of Fe 
on a-Sn indeed suppresses the Dirac cone, that is a signature of the TI, whereas it 
is still observable after deposition of a Ag layer [see Fig. 5.14d]. Hence the absence 
of spin-charge conversion at Fe/o--Sn interfaces is clearly ascribed to the loss of TI 
surface states after the Fe deposition. This study shows that characterization of the 
DOS by ARPES measurements is a very useful and unique technique to understand 
spin-charge conversion at such spinorbitronic interfaces. 


5.2.2.2  Heat-Spin Conversion 


Similarly to the Seebeck effect, the spin Seebeck effect describes the generation of a 
spin voltage from a temperature gradient in a FM conductor or insulator. Longitudinal 
spin Seebeck effect (LSSE) refers to experiments where the spin current generated 
is parallel to the temperature gradient. Materials used are FM materials (conducting 
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Fig. 5.14 a Scheme of the device studied for the spin to charge conversion by spin pumping into 
the topological insulator o-Sn. b Illustration of the inverse Edelstein effect. A spin current injected 
in the topological insulator induces an accumulation of charge for one spin direction inducing a shift 
of the Fermi contour creating a charge current. c Ferromagnetic resonance and dc charge current 
signals for a-Sn/Fe/Au and a-Sn/Ag/Fe/Au samples. Only the second sample shows a dc current 
signal. d ARPES measurement along the [100] or [110] directions on the free surface of o-Sn (top) 
and after deposition of Fe (left) or Ag (right). The Dirac cone subsists after deposition of Ag but 
not for Fe. Adapted from [45] with permission (Copyright 2016, American Physical Society) 


or insulating) in contact with heavy metals. In order to extract LSSE efficiency, it is 
necessary to discard the possible artefacts such as an anomalous Nernst effect that 
arises from the potential induced magnetization in the NM layer [46]. 

Prototype material systems contain a thin film of yttrium iron garnet Y3FesO;; 
(YIG) as a FM insulator and a thin layer of Pt as high spin-orbit non-FM mate- 
rial. In order to detect and quantify the magnetic proximity effect (MPE) at the 
YIG/Pt interface, XMCD measurements appear to be the most accurate method. In 
Fig. 5.15, XMCD measurements at the Pt L» 5 edges performed on YIG/Pt samples 
are shown. Whereas no MPE have been detected for one study [see Fig.5.15a] [47, 
48], a clear XMCD signal has been observed for an another experiment [Fig. 5.15b] 
[49]. This discrepancy can be attributed to different interface qualities. Actually, x- 
ray absorption near-edge spectra (XANES) have clearly different shapes. The white 
line intensity (XANES step height) depends on the number of holes in the 5d band 
and is, therefore, sensitive to the oxidation state of Pt. In the case where no MPE has 
been detected, the white line intensity is about 1.3 which is similar to the Pt metal 
value. On the contrary, when MPE is observed, the white line intensity is about 1.45 
which is close to what is observed for PtO; 36 samples [50]. These observations mean 
that the observed MPE is probably due to intermixing at the YIG/Pt interface. 
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Fig. 5.15 XANES and XMCD recorded at the Pt L2,3 edges for YIG/Pt samples. For apparently 
similar samples a clear XMCD signal is observed for one sample (right), whereas no induced 
magnetic moment is measured for the other (left). The higher intensity of the step edge observed 
for the sample exhibiting an XMCD signal (right) meaning an increase of the number of holes in 
the 5d band indicates a possible Pt oxidation. Hence different interface qualities induce different 
XMCD signals. Reproduced from [48] (left) and from [49] (right) with permission (Copyright 2013, 
American Physical Society) 


This behaviour has been clearly evidenced by further experiments on insulating 
ferrite/Pt samples. In Fig. 5.16 (left panel), XMCD measurements performed at the Pt 
M3 edge on CoFe,0,/Pt samples are presented [51]. A clear XMCD is observed for 
samples where Pt has been deposited at high temperature (HT in Fig. 5.16), whereas 
no XMCD signal is detected when Pt is grown at room temperature (RT in Fig. 5.16). 
By performing XAS and XMCD measurements at Co and Fe L2 3 edges [see Fig. 5.16 
(right panel)] it appears clearly that the deposition of Pt at high temperature induces 
some intermixing at the CoFe;O4/Pt interface. Indeed, Co XAS spectra are close to 
those of metallic Co films for the sample with Pt grown at high temperature, whereas 
for the sample with Pt deposited at room temperature, spectra shapes are similar to 
those of CoFe20; thin films. Hence the presence of MPE in ferrite/Pt interface comes 
from interface alloying, whereas for clean interfaces, the induced magnetization by 
proximity effect is absent. In fact, XAS is not the only measurement to detect MPE; 
X-ray resonant magnetic reflectivity is also a powerful technique. Thanks to the 
latter, T. Kuschel et al. have confirmed that no MPE occurs at the clean NiFe,O,/Pt 
interface [52]. Finally, it has been shown that no MPE occurs for another ferrite 
(MnFe20,/Pt) and for magnetite, both in the conducting and insulating states [53]. 
This suggests that the absence of MPE at the magnetic oxide/Pt clean interface is a 
general rule. 
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Fig. 5.16 XAS and XMCD spectra recorded at the Pt M3 edge for CoFe?O4/Pt where Pt is grown 
at room temperature (RT) and high temperature (HT). A clear XMCD signal is observed at the Pt 
M3 edge when Pt is grown at high temperature. A clear difference can be also seen in the Fe and Co 
L2,3 edges. For Pt grown at HT Co L»,5, spectra look like Co metal signifying an intermixing at the 
CoFe204/Pt interface. When Pt is grown at RT, Co and Fe L2,3 XAS spectra are similar to those 
of CoFe2034 thin films. Adapted from [51] with permission (Copyright 2018, American Chemical 
Society) 


These experiments hence highlight that synchrotron radiation-based spectro- 
scopies are powerful to probe the electronic and magnetic properties of interfaces 
and can be very useful to discard artefacts in LSSE experiments. 


5.2.3 Current-Induced Magnetization Dynamics 


Besides STT-MRAMs, other conceptual memory devices have been proposed and 
largely studied in the decade. A flagship example is the domain racetrack memory, 
proposed by S. S. P. Parkin [54], which is a nanoscale shift register memory in which 
bits are defined by magnetic domains separated by domain walls. Another version 
of such devices has been recently introduced where the bits are stored thanks to 
the presence of magnetic skyrmions [55]. In this section, we show that magnetiza- 
tion dynamics imaging using synchrotron radiation-based measurements that can be 
useful to better understand these STT-MRAM and racetrack memory devices. 


5.2.3.1 Spin-Orbit Torque Driven Magnetization Reversal 


SOT is an efficient tool to manipulate magnetization in spintronic devices integrating 
only one FM electrode (see, for example, R. Ramaswamy et al. for a recent review 
[56]). M. Baumgartner et al. [57] have studied Co nanodot magnetization reversal 
by SOT. In Fig. 5.17a, the magnetization reversal [measured by XMCD at the Co L5 
edge (black)] induced by a current pulse (red) is shown. Then, by performing time- 
resolved scanning transmission X-ray microscopy (STXM) measurements, it has 
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Fig. 5.17 a Magnetization reversal probed by XMCD measurements at Co L3 edge (black) induced 
by a current pulse (red). b STXM images recorded at intervals of 100 ps during a2 ns injected current 
pulse. The four rows correspond to different applied current (yellow arrows) and magnetic field (blue 
arrows) conditions. Red dots indicate domain wall nucleation and green arrows the direction of the 
domain wall propagation. Adapted from [57] with permission (Copyright 2017, Nature Publishing 
Group) 


been possible to perform a direct observation of the actual path leading to the mag- 
netization reversal during the current pulse injection. Figure 5.17b represents STXM 
images recorded at the Co L3 edge with 25 nm spatial resolution at intervals of 100 ps 
during a 2 ns current pulse. From these images, nucleation (red dot) and propagation 
(green arrow) of magnetic domain walls can be clearly observed. Depending on the 
applied magnetic field direction and the current polarity, the main characteristics of 
these nucleation and propagation might change. These measurements have allowed 
demonstrating that this diagonal motion of the domain wall originates from a com- 
bination of the damping-like and field-like SOTs and the Dzyaloshinskii-Moriya 
interaction. 


5.2.3.2 Current-Induced Domain Walls and Skyrmions Motion 


Racetrack memory-based on domain wall or skyrmion motion is an attractive field for 
future magnetic memories. Synchrotron radiation-based techniques such as PEEM 
and STXM or even X-ray magnetic resonant scattering (XMRS) have allowed to 
study the fine structures of magnetic textures such as chiral domain walls [58], vortex 
[59] and skyrmions [60, 61], see, for example, X. Cheng et al. for a review [62]. A 
key point is also to optimize the motion velocity and determine the factors that could 
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Fig. 5.18 (left panel) XMCD images of Co and NiFe electrodes of a NiFe/Cu/Co spin valves 
recorded at the Co L3 edge a and Fe L3 edge b, c. Different magnetic domains are observed 
depending on the parallel b or antiparallel c magnetic configuration (right panel). d Sketch of the 
Pt/Co/Ta stripe deposited on a Si3N4 membrane. e STXM images recorded after current pulses 
showing that three of the four skyrmions move (orange, yellow and red circles) after the current 
pulse. The fourth skyrmion (white circle) is pinned. f Skyrmion velocity as a function current density 
injected for Pt/Co/Ta and Pt/CoFeB/MgO devices. from [63] (left panel) with permission (Copyright 
2010, American Physical Society) and from [29] (right panel) with permission (Copyright 2016, 
Nature Publishing Group) 


limit this velocity. X-ray imaging is a powerful tool to investigate magnetic object 
motion induced by a current. In the following, we show two examples of domain 
wall and skyrmion motions probed by XMCD-PEEM and STXM. 

Large domain wall velocities (~600ms~') have been measured in NiFe/Cu/Co 
spin valves [63]. Unexpectedly, the domain wall motion is altered when longer electri- 
cal pulses or higher current densities are applied [63]. By performing XMCD-PEEM 
measurements, it has been shown that the dipolar interaction between the NiFe and 
Co electrodes was a source of domain wall pinning. Thanks to the chemical selectiv- 
ity of X-ray photoemission, magnetic configuration of the NiFe and Co electrodes 
can be probed by recording images at the Fe and Co L3 edges [see Fig.5.18a, c]. 
In the parallel magnetic configuration, the stray field of the Co domain wall locally 
reverses the magnetization in the NiFe layer, leading to the three domain walls [white 
circle in Fig. 5.18b]. On the other hand, in the antiparallel magnetic configuration, 
the magnetic flux closes naturally and a single domain wall is formed [Fig.5.18c]. 
This magnetic imaging of the Co and NiFe electrodes demonstrates that the stray 
field prevents domain wall motion across the corners in the NiFe layer. 

An important property of the racetrack memories is that the magnetic objects 
(domain walls or skyrmions) move all together. In Fig. 5.18(d-f), we show that STXM 
measurements can be used to probe current-driven skyrmion motion in a Pt/Co/Ta 
stripe as shown by K. Woo et al. [29]. Each image is recorded after injection of 
current pulses. Three of the four skyrmions (red, yellow and orange circles) move 
forward and backward depending on the current polarity [see Fig. 5.18e]. Note that 
the fourth skyrmion (white circle) is not showing any motion under current injection 
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(for both polarities) and is probably pinned by a material defect. Skyrmion velocities 
can be extracted [see Fig. 5.18f] and it is shown that they increase from —50ms ! 
to —-120 ms^! by replacing Co by CoFeB. These measurements allow to conclude 
that lower pinning materials such as amorphous CoFeB layer, i.e. without grain 
boundaries, are probably more appropriate to get an efficient skyrmion motion. 

X-ray microscopy is thus a powerful tool to study in real time the current-driven 
magnetization dynamics in various magnetic systems. We have shown three exam- 
ples where magnetic reversal, domain walls and skyrmion motion are investigated 
by STXM or XMCD-PEEM. This is not limited to FM systems as antiferromagnetic 
domains can be also probed by performing linear dichroism. Recently, M. J. Grzy- 
bowski et al. have measured current-induced antiferromagnetic domain switching in 
CuMnAs by performing XMLD-PEEM measurements [64]. 


5.3 Conclusion 


We have shown through few examples that synchrotron radiation-based spectro- 
scopies are powerful techniques to perform advanced characterization of various 
spintronic systems. For example, XAS (including circular and linear dichroism) 
allows probing the electronic and magnetic properties of surfaces, interfaces and 
bulk materials. Spin- and angle-resolved photoemission is an ideal tool to measure 
the actual spin polarization and reveal the density of states of surfaces and inter- 
faces. It can be applied to a wide variety of materials used in spintronic devices 
such as ferromagnets, antiferromagnets, topological insulators, Rashba interfaces, 
hybrid ferromagnet/molecules interfaces, 2D materials and multiferroics. XPEEM 
and STXM are also perfect techniques to study the dynamics of magnetic structures 
(ferromagnetic, ferrimagnetic and antiferromagnetic) but also to measure locally 
the electronic structure of surfaces. This is not exhaustive; other techniques such 
as X-ray magnetic scattering, resonant inelastic X-ray scattering, hard X-ray pho- 
toelectron spectroscopy, nano-ARPES can be also used to probe spintronic device 
properties. Finally, the development of new techniques such as magnetic X-ray nan- 
otomography [65] and novel X-ray sources (new generation of synchrotron and X- 
ray free-electron lasers) improving coherence, spatial and temporal resolution, will 
allow obtaining deeper characterization and understanding of magnetic textures and 
spintronic devices. 
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Chapter 6 A) 
p-Wave Superconductivity and d-Vector get 
Representation 


Jean-Pascal Brison 


Abstract Since the mid-80s, new classes of superconductors have been discovered 
in which the origin of superconductivity cannot be attributed to the electron-ion 
interactions at the heart of conventional superconductivity. Most of these uncon- 
ventional superconductors are strongly correlated electron systems, and identifying 
(or even more difficult, predicting) the precise superconducting state has been, and 
sometimes remains, an actual challenge. However, in most cases, it has been demon- 
strated that in these materials the spin state of the Cooper pairs is a singlet state, 
often associated with a “d-wave’ or ‘s + /—’ orbital state. For a few systems, a spin- 
triplet state is strongly suspected, like in superfluid ?He; this leads to a much more 
complex superconducting order parameter. This was long supposed to be the case for 
the d-electron system Sr; RuO,, and is very likely realized in some uranium-based 
( f-electron) ‘heavy fermions’ like UPt3; (with multiple superconducting phases) or 
UGe» (with coexisting ferromagnetic order). Beyond the interest for these materials, 
p-wave superconductivity is presently quite fashionable for its topological properties 
and the prediction that it could host Majorana-like low energy excitations, seen as 
a route towards robust (topologically protected) qubits. The aim of these notes is to 
make students and experimentalists more familiar with the d-vector representation 
used to describe p-wave (spin triplet) superconductivity. The interest of this formal- 
ism will be illustrated on some systems where p-wave superconductivity is the prime 
suspect. 


6.1 Introduction 


The purpose of these notes is only to cover some aspects of spin-triplet supercon- 
ductors, not so commonly covered in the excellent textbooks available on super- 
conductivity, in general, and unconventional superconductors, in particular. Among 
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those, let us choose to quote only two: the seminal Reviews of Modern Physics 
paper “A theoretical description of the new phases of liquid ?He" by A. J. Leggett 
[1], which gives both very advanced and detailed insights on the theory of the p- 
wave order parameter of superfluid ?He, and pedagogical and enlightening treatment 
of the microscopic Bardeen-Cooper-Schrieffer (BCS) theory of anisotropic super- 
conductors and the other, which covers the very important symmetry aspects of 
unconventional superconductors in crystalline materials, is the book ‘Introduction to 
unconventional superconductivity' by V. P. Mineev and K. V. Samokhin [2]. 

In the following, we concentrate on some basic aspects of the description of spin- 
triplet superconductors, which are often bewildering, at least to experimentalists. 


6.2 Odd-Parity Pairing: BCS Wave Function and Order 
Parameter 


Most known superconductors are ‘spin-singlet’ superconductors, meaning that the 
relative wave function of the Cooper pairs |Y (ry — r2)), in the real or in the reciprocal 
space, can be written as a product of an orbital wave function and a spin (singlet) 
wave function 


Wry —r)- prr tN, 


(6.1) 
[Y (k)) PIT — 41). 


Antisymmetrization of the total pair wave function imposes, for such a singlet state, 
that the orbital wave function verifies $ (r1 — r2) = $ (r2 — r1) or (k) = q(—k) 
(even-parity state). However, it is also possible to build Cooper pairs in a triplet spin 
state (see Fig. 6.1). If all electronic interactions including the pairing interactions 
conserve spin, one could pair separately up- and down-spins, and the total super- 
conducting wave function with a triplet spin state would be the (antisymmetrized) 
product of both. However, if any non-spin conserving term exists, like the spin—orbit 
interaction, this is no longer possible. One can just say that Cooper pairs will be 
formed with a wave function of the form 


IW) = $i(1— r2) 11) + 621 — r2| 44) + $1204 r2] TL + Jt). (62) 


or in the reciprocal space 


IY) = en(k)I 11) + 9209] 44) + v1509] 11 + 10) . (6.3) 


Antisymmetrization of the total pair wave function imposes this time that the orbital 
wave function $(r; — r2) = —$ (r2 — r1) or y(k) = —e(—k) (odd-parity state). 
Note that microscopically, one would write the ground state superconducting wave 
function for the whole Fermi sea as 
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Fig. 6.1 Singlet a versus triplet b Cooper pairs: they are built with quasiparticles of opposite wave 
vectors in both cases, but differ by their spin state 


[v) = | [enn + VEE EC kp | + Ue C Oe + Vit Ce C )10 > 


all k 
= I] (Uk, Vk ob Chel Kg) |0 > 
all k,a,B 
(6.4) 
with Uk ag = U-k,af 3 Ukap = —U-kof >» 


and — @yg(K) = (c-kgCko) = Ukop Vkop = —@upg(—k) the order parameter [1] . 


The last condition on the parity of uy and vy for the same spin indices ensures 
that the orbital part is odd (for the exchange of k and —k), selecting only triplet 
spin components. Coming back to the order parameter, in the reciprocal space, it 
should be given by three complex odd functions of k: 911, #22 and qj» = 9»;. The 
most natural would be to view the order parameter as a 2 x 2 symmetrical matrix 
Pap, Where o and f are spin indices (1 =f, 2 =|). This is possible, and is used in 
many calculations. However, it is not very convenient if one needs to change the 
quantization axis or if (as it commonly happens) the quantization axis changes over 
the Fermi surface. There are only three independent complex functions of k, so it 
would be nice to represent the order parameter by a vector. 


6.3 Vectors and Cayley-Klein Representation 


6.3.1 Position of the Problem 


However, this would be meaningful only if this vector transforms properly under 
rotation of the spin quantization axis. And one would also expect its magnitude to 
be proportional to the density of condensed Cooper pairs, and its direction to have 
a meaning relative to the spin orientation. This last point is clearly not so direct, as 
the vector will necessarily be complex. In order to understand more clearly what is 
necessary, let us first explore what doesn't work. We could build simply such a vector 
representation through: 


V = qiue + 95e, + P12 . (6.5) 
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But it would not do the job: the module would be fine, but the direction of V and so 
its transformation under rotations of the axis would be meaningless, for example, for 
the same quantization axis a pure | ++) or | ||) state would lead to perpendicular 
vectors. Or equivalently, taking an opposite direction of the quantization axis would 
yield perpendicular vector representations. This is clearly not what is expected from 
a vector behaviour. The problem stems from the fact that one needs to make a link 
between the spin state [SU(2)] and three-dimensional vectors. The good news is 
that this problem has been solved long ago in classical mechanics, with the Cayley— 
Klein representation, which aimed at simplifying the calculation of rotation effects; 
in real space, a matrix rotation is a 3 x 3 matrix; however, it is fully characterized 
by only three angles (the Euler angles for example); so, in principle, a 2 x 2 matrix, 
with four parameters, should be more than enough. The Cayley- Klein representation 
associates a three-dimensional vector (a) to a 2 x 2 matrix through ... Pauli matrices 


a 20, 


0 = Oj,€y + 02e, + 03€; , 


01 0 —i 10 
ELE n=o,=(' 2) a-«-(5)- (6.6) 
cr 63; — Ójj — 189; UNE a a — ima 
Ti = (bu t ido a ao = aioi = Naa tin vds i 


where 6;; is the Kronecker symbol. 


6.3.2 Useful Formula for Pauli Matrices 


As a reminder, for these (Hermitian) Pauli matrices 


oj 21 ; [onoj] = 2i or ; {0i 0j} = 281, 
ojoj = icto, + 51, (6.7) 


tr(o;) = 0; det(o;) = —1; eigenvalues = +1 , 


where €"/* is the Levi-Civita symbol. 
From that, a little algebra leads to very useful formulae (a and b are real or complex 
3D vectors) 


(a-o )og = (ajoj;)oy = ajc"; + aiik l = sie ajo; tad. 


So 
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(a-0)0 =al—iaAo ; o(a:.0)—al-Fia^o ; 
tr((a-0)0) —2a; (6.8) 
(a-o)(b-o) = (a-b)1 +i(aAb)-o. 


Finally, if a is real, or if at least one can write a = a - à, with a, a complex number, 
and â, a real unit vector, then additional useful relations exist 


e the eigenvalues of a - ø are +a; 

e the projectors on each eigenspace can be written as 1 (1 B a-o); 

e for any analytic function 
f (ao) = © (1+ 4-0) + $9 (1 — 4-0) and 

e in particular, if €? is a real vector, also written as € = Q Ê, Q, a real number, and 
Ê, a real unit vector 


tees exp (i22) + exp (—iQ2) 14 =D (£2) — ap (—i1Q) ba 
2 2i 


, 


(6.9) 


exp (19-0) = cos Q 1+ isin €2 Q-o |. 


6.3.3 Rotation of a 3D Vector: Cayley-Klein Relation 


From these relations, it is straightforward to see (proof at the end of the chapter) that 
if R is a 3D rotation characterized by an angle Q around the axis €, for any vector a 
R(a)-o = exp (—i1/20-0) (a- o) exp (/29-0) , 


(6.10) 
R(a).0 = Ro(a-0)R o |, 


where Rg = exp (3 2-0) is the rotation matrix around & for a spin 1/2. So one can 
work with 2D (complex) matrices to calculate the effect of a 3D rotation R on a real 
vector a. 

In fact, this is more general in the sense that it is also true when applied on complex 
vectors (rotating around a real vector @ ). Indeed, the effect of a 3D rotation of angle 
Q around the axis € on a real vector a can be easily expressed through the relations 
(see Fig. 6.2) 


a—(a-)9--a— (a 9) , 


Ra) = (a. 9) + cosQ (a (ae $$) +sinQ(Ê ^a) |. (6.11) 
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Fig. 6.2 Decomposition of a 
vector for the calculation of 
its rotation by an angle Q 
around Ê 


And (6.11) can be used to define what is the rotation of a complex vector around a 
real vector €. With such a definition, the Cayley-Klein relation (6.10) also works 
when a is a complex 3D vector (see ‘proof’ in Sect. 6.11). 


Exercise 6.1 Show that with the definition of the rotation (6.11) of a complex vector 
(around a ‘real vector €^), the scalar product and the cross product are conserved 
under rotation: 
R(d)- Ru) 2d-u, 
(6.12) 
Ru) A Rd) 2 RUA). 


Solution in Sect. 6.11. 


6.4 d-Vector Representation 


Coming back to the problem of finding a vector representation of the order parameter, 
if we could cast the 2 x 2 matrix order parameter (g) in the form (a-o), there are 
good chances that the vector (a) would do the job. Working in the reciprocal space 
(k = kpn), where kp is the radius of the Fermi surface, we start from 


Iud) = 3 Gap (fi) |B) 
m (6.13) 


— ANH] tt) + A* | 44) + A? f +141), 


— Paa PBa : _ = At A9 
(y) = is e with qus — Ppa or (P) = ( Aba]: 619 
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Comparison with expression (6.6) for a-o doesn't fit, notably as regards the non- 
diagonal symmetric terms. This is due to the o? component of a-o . It can be eliminated 


if one calculates 
i(a o) 02 i A : * (6.15) 
a + 142 —a3 1 0 


This allows for a straightforward identification of a vector representation of the 
order parameter, noted as d, of components 


Pop =  G(d.o):o;)gg > (Y)=1d-0)-o. (6.16) 


Note that, traditionally, d is normalized to 1 (in a sense to be precised later), 
like a wave function, whereas the order parameter amplitude reflects the 'superfluid 
density' and is proportional to the gap in the simplest cases. The equations above do 
not reflect this subtlety that will be precised later on (see Sect. 6.6.1). Therefore, to 
be ‘in line’ with the convention of most papers on the subject, we will introduce a 
(k-independent) proportionality factor y 


gu= At= y (-d, + idy) Wde= F(-AT+ A= $F (-g11 + 9») 
gn= A= Wdy +idy) } & į Wdy=—3( AT+A=—3( G+ 9) |. (6.17) 
p2 = = A°= Wd.) yd.— AD = 4( g2t+¢r1) 


And convenient expressions for calculations deduced from (6.16) and (6.8) read: 


3 
IW) = Do guplaB) — iV. 3 ^ di(oior)epleB) , 
a,b 


aß,i=1 
Iw) = iy $ Bd- o)oa)lap) | (6.18) 
aß 
pado) = -ieoa = |d- Zr ooo) : oo 
° =— Nor = —tr 0: == (oy aB > 
q).02 2j Q)to2 2y - 20 Jap (a. B 


d= gj I-A Gs + iky) + A‘ (i) (Ky — iky) + 2A?k;] |. (6.19) 
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6.5 Behaviour under Rotations 


6.5.1 Rotation in Spin Space 


For d to be a true vector, it should behave appropriately under rotation. d is rep- 
resenting an order parameter which has both orbital and spin degrees of freedom, 
but the specificity of odd-parity pairing, leading to the necessity of such a vector 
representation, is coming from the spin degree of freedom. With the relationship 
to the Cayley-Klein representation, one should expect that this choice leads to a 
relationship between rotation in spin space and rotation of d. In fact, the effect of 
rotations can be calculated both directly and with the generator of rotations. Let's do 
both methods. 

For the direct evaluation, the important point is that the rotation acts simultane- 
ously on both spins. Starting from the expression (6.18) to evaluate the effect of the 
rotation on the spin part of the order parameter, we get 


Ral¥) = iy Y "(BI(d - o)or\a)Ri.2 8 Ra glap) 


ap 

= iy Y (5|RalB)(B\(d - o)o2|x)(y|Rela)|y5) 
apy 

= iy Y 6IRo(d - o)o:]o) (y lo} - Ralo)|y5) . 
ays 


We have 


(vla? - Rala) = Y` (v lozin) (n| cos(Q/2)op — isin(Q/2)or®.|a) 
n 


= Clos» tel — cos(Q/2)os — isin(Q/2)o5Q.a |n) 
n 
= (alo2&-oo»|y) . 


as o» is antisymmetric and o5(&).0) is symmetric. So 


Ra|W) = iy 3 (8/Ro(d - o)ozla) (|o R ocs|y)1y) 


ays 


=iv ) GIRa(d * 6) R -go»ly)|yà) 
yó 


= iy $ (8|(R@)-0) osly)yà) . 


yó 


Using (6.10) 
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Re|W) = |Y (R(d))) . 


So indeed, the effect of a change of the spin quantization axis on the order parameter 
can be evaluated directly by the corresponding rotation of the (complex) d-vector 
in 3D. And the calculation above makes a direct connection between the Cayley— 
Klein transformation and the rather involved definition of the d-vector. 

Itis also useful (and simple) to evaluate the effect of a rotation using the generator 
of rotations in spin space: this generator is simply —if.S, where the total spin 
S=S8,@1+1@S8,. 

The effect of any operator O = O; & 1+ 1 & O; acting in the spin space can be 
calculated as (remembering that (d-o )o2 is a symmetric matrix and o» an antisym- 
metric matrix) 


OY) = iy 3 (BIG - a)o»|o) Ola) 
ap 


= iy $ (Id - o)o31o) (y|Ola)|yB) + (^101B)lory)) 


apy 


(6.20) 
=ip Y (BI0(d - o)os|o) (luf) + |Be)) or 
ap 


= iy $` (810(d - o)o|a) + (o|O(d - a)o2|8)) lo) . 
ap 


Applying (6.20) to the action of generator of rotations in spin space, namely, — if-S, 
we get 


-;&s) = i; 26r : e )(d - 0)o»|a) (lag) + |Ba)) 


= 5 J (Blas: i(à ^ doola) (laf) + (Bo) . 
op 


So i 
-=ñ.S 1Y) = [YÂ ^d). (6.21) 


So that applying a rotation in spin space amounts to the same rotation of the d-vector 
[see (6.11)] for an elemental rotation of d: Rg,s|¥) = |Y (Re(d))) (see [3]). 


6.5.2 Rotation in Real Space 


For rotations in real space (on the orbital degrees of freedom), we should calculate 
the effect of —;i.L, with L = r^ (2v,). Writing D(k) = [d(k)-o Joo, we get 
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-zL|)- - (r^ vo fake 3 (BID (k) |e) lop) 
ap 


=y » f dk i(r ^ k)e "*" (B| D(k) |) |B) 
=y 2. f dk (8D (k)læ) (k ^ Vie" |B) 
--y Sad — ((B|D(k)|a)k) ^ k; e™" ap) 
= -y Y. f axe-*riic A V)DW lalap) , 

ap 


SO 
i 1 
ALI) = |w(—if-Lyd(k))), with Ly =k A -Vp . (6.22) 
1 


This last expression shows that a rotation in real space acts, as it should, on the order 
parameter according to its orbital state: p-wave, f-wave, ... for a triplet supercon- 
ductor, transposed as usual in the reciprocal space. 


6.5.3 Change of Quantization Axis: Application to ESP 
States 


In order to get more familiar with rotations of the d-vector, let us start with an 
exercise: 


Exercise 1 Consider the very first example of Sect. 6.3.1 to observe the fate of the 
d-vector under a change of orientation of the quantization axis on a simple | 77) 
state. Solution in Sect. 6.11. 


Beyond this ‘trivial’ example, understanding the behaviour under rotation of the 
d-vector is particularly useful to get a more precise idea about some specific spin 
states. For example, we can easily understand that any state |W) = Aol tl + 11) 
can be considered as an ‘equal spin pairing’ (ESP) state, with equal weight on | 7 7) 
and | ||) spin components. Indeed, its d-vector is simply 


1 0 
d=—y] 0 
V V Ao 


Let us rotate the quantization axis by —7r/2 around an axis Q in the x- y-plane with 
an angle ọ from the x-axis. To get the coordinates of d in the new frame, we should 
rotate it by 77/2 around Q [remember (6.11)] 
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. COS o 
Q= | sino |, 
0 
. sin g ^o 
Rg(d) = sin(z/2& ^d = z —cosgAg] , 
0 


Ay = V (—d, + idy) = —ie^ Ag ; Ay v(d, tidy) = —ie™ Ag, 


which is indeed an ESP state with only 4^7 and {J spin components. It is a good 
exercise to check that, reciprocally, any ESP state with equal weight for the up- and 
down-spin components can also be written as a pure S$, = 0 state for some choice of 
the quantization axis. 


Exercise 6.2. Show that any ESP state with equal weight for the up- and down-spin 
component can also be written as a pure |S, = 0) state. Solution in Sect. 6.11. 


6.6 Some Uses of the d-Vector Representation 


6.6.1 Amplitude of the d-Vector 


As promised, let us say a few words on the question of normalization of the d-vector. 
For s-wave superconductors, in the simplest cases, we know that the order parameter 
can be taken as proportional to the gap. Of course, this is wrong in the general 
case, e.g. gapless superconductivity exists (induced by a critical amount of magnetic 
impurities for example). But the idea is that |y |? somehow represents the superfluid 
density. For a spin-triplet superconductor, we can define this quantity as 


d2 : dR MT 
(YY) = f a. D Fines = f a, 60e 8) 
2 dQ " 2 dQ -— 
= qvi je -e)(d + 0)o3) = 2v ESTE 
T An 


Note that the definition above is coherent with the fact that from the very beginning, 
we did not normalize (by v5) the | ^^ + 44) component of |y) in (6.3). |d(&) |? can 
be interpreted as the angular-dependent superconducting (or superfluid) density, and 
by convention, one takes 


d2 suos 
f Sa E (6.23) 


So on calculating averaged quantities (Y|O|W)/(W|W), one should remember that 
(vv) = 2|yp. 
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6.6.2 Spin Direction 


Up to now, we discussed the properties of the d-vector under rotation but we did not 
unveil the signification of its direction. As announced, it cannot be straightforward, 
as in the general case, d is a complex vector. But it should be related to the spin. So 
let us calculate S| v), in the same way we performed the calculation of the effects of 
rotations in spin space, using the generator [Sect. 6.5.1, (6.21)] 


SIY) = 2 Y velo (d -o)ozla) (laf) + 1o) 
aß 


= Ê Y pleg- (d A ooo?) aff) + |80) , 
ap 


from which we deduce immediately that 
d-S|v) =0. (6.24) 


This means that if d is real (up to a phase factor), it is perpendicular to the direction 
of the Cooper pairs spin (quantization axis). More explicitly the average spin at a 
given wave vector k of the Fermi surface can be calculated as 


emp 7 52,2, 0e - eI ^ eso) (yel (lap) + lf) 


y». a 


=F rd sod m) = 5 (AAA oe) 


7 (tr ((d ^ d*)1) — i tr (d ^ (d* ^ @))) 


= ih(d A d*) - 4 tr (dod -o)) 


(S(k)) = ifd(k) ^ d*(k) |. 


(6.25) 


6.6.3 Non-unitary States 


The above equation is important. Indeed, if d(k) ^ d*(k) is non-zero, the state is 
called a ‘non-unitary state’ and it has some more involved properties. Moreover, 
in general, the fact that d(k) ^ d*(k) is non-zero means that locally, on the Fermi 
surface, the Cooper pairs spin is non-zero. But it does not mean that globally, the 
superconductor is spin-polarized. Conversely, if the superconductor is globally spin- 
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polarized, it is necessarily in a non-unitary state, where d* is not proportional to d, 
see Sect. 6.9 on ferromagnetic superconductors. 

This notion of ‘non-unitary’ state is usually bewildering, and it is useful to make 
some simple calculations in order to get more used to it. For example, we can check 
what are the conditions under which an ESP state can also be non-unitary. An ESP 
state has only A4 and A, components, so that its d-vector will be of the form 


I(-A! + A) 
0 
Then 
d ^ d* = (d.d; — d,d?)e; 
: 2 
2 w [(—At + AAT + AM) + (AT + AY)(—A™ + Ae 
-iyp? 


[ATP — |A*["]e; . 


2 
From (6.23), we get that V (JA! [2 + |A! 2) = 1. So 


[Af]? Aer 


ihd ^ d* = A 
i [At + TATE 


ez. (6.27) 


The conclusion is simple: an ESP state is non-unitary only if the amplitude of the 
A4 and A, components is different on some part of the Fermi surface. 


6.6.4 Orbital Moment 


In the same way, from (6.22), we calculate that the average orbital moment per 
Cooper pair is [1] 


Y. (vlos(d* + 0)18) (BIKA Vi) (dK) - o)ozla) (vólo/B) 


(W|L|W) -í2 h 
a,B,y,6 


(|) 4x 2i 


dQ h è 
= f SF tr (E o) A VAW +0), 


ñ (d2, 
(Ld) = = f 45 2. di ^ Vidi (4) |. (6.28) 
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Note that if d is real, (L) is zero (if not, it would be imaginary!). This appears in 
(6.28) from 


dQ x 4. aad i? 2 s 
fiis (k ^ Vi)d;(k) =$ = D T (k) 
= D d?(k) 20 
=- P l^ede- . 


A last remark on this point: superconductors for which (L) is non-zero are nowadays 
called ‘chiral superconductors’ and quite looked-after for their potential topological 
properties [4]. Note, however, that if only triplet superconductors can have a non- 
zero (S), this is not the case for (L): both spin-singlet and spin-triplet can be chiral. 
Naturally, in case of spin-singlet, the superconductor needs to be unconventional (not 
s-wave), and intrinsically complex, so that (L) can be non-zero. This is the case, for 
example, of d-wave superconductors of type “d + id" or (ky + iky)k,... 


6.6.5 Excitation Energy of Quasiparticles 


We will not derive the energy spectrum from microscopic theory, just report the results 
(see [2] for example): for triplet superconductors in a unitary state, the expression 
of the energy of elementary excitations is very similar to that of singlet anisotropic 
superconductors, with the k dependence ofthe energy gap controlled by the amplitude 
of d(k) 


E, = Je + A? (|d(k)|?) . (6.29) 


However, for non-unitary states, two branches appear in the spectrum, depending 
on the spin orientation of the excitations with respect to (S): it is as if they are 
‘Zeeman split’ by (S). So the energy gap is expressed in such a case as 


E; = Jg + A? (|d(k)|? + |d(k) ^ d*(k)|) . (6.30) 


Hence, this is another true difference with respect to singlet superconductors. 

We can also see how both (6.29) and (6.30) read when using not the d-vector 
notation, but expression like (6.17) for the order parameter. In the unitary case, the 
gap A (k) would be expressed as 


1 
A(k) — i (IA* GP? + [AY (|?) + A? OP? . 


And in the case of non-unitary states, if we take the ‘simple’ example of ESP states, 
using expression (6.27) for d ^ d* we derive easily that 
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A(k) = NS (Ida)? + |d(k) ^ d*(k)]) 
—|A'() or JA*(K)|. 


(6.31) 


This last expression shows concretely why 'non-unitary states' are a distinctive fea- 
ture of spin-triplet superconductors. It can be also anticipated that this expression will 
be particularly useful for ferromagnetic superconductors, where band polarization 
can lead to a large difference between | A‘ (k)| and | A‘ (K)| (see Sect. 6.9). Expres- 
sion (6.30) gives a general formula for the two gap values of a non-unitary state, even 
if it is not an ESP state: as will be seen later, UPt; in its B-phase could produce such 
a case (see Sect. 6.8.2.3). 


6.7 TheSpin-Orbit Issue 


Before discussing some emblematic examples of p-wave superconductors, let us say 
a few words concerning the question of spin—orbit coupling. Indeed, when discussed 
for real materials (except for superfluid ?He), it covers two different aspects which 
should be distinguished to avoid confusion. The first is the usual spin-orbit coupling 
at the atomic scale, discussed already in the normal phase as it prevents the spin 
S to be a good quantum number. In a solid, symmetries can help to overcome this 
problem: 


e If the system has an inversion centre and time-reversal symmetry, quasiparticles 
with a given wave vector k are necessarily degenerate. This allows to define a 
*pseudo-spin 1/2’ and to build Cooper pairs with this pseudo-spin state: replacing 
*spin' by ‘pseudo-spin’ is all that is required to keep everything else unchanged. 

e If the system has an inversion centre, but not necessarily time-reversal symmetry, 
then, at least, one can distinguish between odd-parity and even-parity states. 

e If there is no inversion centre, but time-reversal symmetry, one can still build 
Cooper pairs; however, there is no such distinction any more between even- or 
odd-parity states. Abusively, one can say that singlet and triplet pairings are mixed 
together. Experimentally, large upper critical fields outpassing the paramagnetic 
limit are commonly found for such systems. 


6.7.1 Spin-Orbit and the Superconducting Order Parameter 


However, there is also another issue for ‘triplet’? superconductors with spin—orbit 
interaction: we are now speaking of spin-orbit coupling between the spin and orbital 
parts of the Cooper pairs, as done in the case of superfluid ?He: the problem is that 
the ‘atomic-scale’ spin—-orbit coupling can be very large (see, for example, what 
happens in the *He nuclei!), whereas the coupling between the total orbital moment 
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of the Cooper pair (an object of a coherence length scale) and the total spin of the 
Cooper pairs can be much weaker. And in real solid, it is very difficult to either 
calculate (predict) or to measure this spin—orbit interaction. This question is very 
important because it determines the symmetry group which has to be considered for 
the classification of the different superconducting states. If spin-orbit is weak, the 
relative orientation of spin and orbit should be decoupled: so, for example, one can 
imagine that the spin could reorient ‘freely’ under the action of an external field. 
If spin-orbit is strong, the orbital state (the gap nodes for example) expected to 
be pinned on the crystal lattice will prevent such a reorientation of the spin state. 
Therefore 


If the spin-orbit interaction is weak, the symmetry group considered for the clas- 
sification of the possible superconducting states will be GQ U(1) & 7 & Rs, 
where Rs are the (3D) rotations in spin space, G is the crystal point group, U (1) 
the gauge symmetry (always broken in the superconducting state) and 7 the time- 
reversal symmetry. Due to Rs, d should reorient under field to minimize the 
Zeeman energy. 

If the spin-orbit interaction is strong, the symmetry group considered for the clas- 
sification of the possible superconducting states will be G & U (1) & T, meaning 
that the d-vector is expected to be ‘pinned’ on the lattice. In such a case, addi- 
tional spin anisotropy may appear in the superconducting state, possibly detected, 
for example, by an anisotropy of the Knight-shift reduction below Tsc, or by an 
anisotropic paramagnetic limitation. 


For most of the candidate p-wave superconductors, determining what is the best 
of the two limits for the description of the system remains an open issue (see, for 
example, the discussion on UPts in Sects. 6.8.2.2 and 6.8.2.3). 


6.7.2 Anisotropy of the Susceptibility for the Strong 
Spin—Orbit Case 


Experimentally, an important question when analysing the behaviour of a potential 
triplet superconductor is the Pauli depairing effect and its anisotropy on the upper crit- 
ical field, or equivalently, the anisotropy of the change of the Knight shift below Tsc, 
both of which depending on the Cooper pairs spin susceptibility. Supposing that we 
are in the strong spin—orbit limit, the question is to derive from the possible order 
parameters, in which directions there will be no change of the electronic spin sus- 
ceptibility between normal and superconducting phases, and in which directions, if 
any, there will be at least a partial suppression of this spin susceptibility. 

As a matter of fact, itis important to realize that even spin-triplet superconductors, 
whatever the spin-orbit regime, can present a reduction of the susceptibility for all 
orientations of the magnetic field. We will see below (Sect. 6.8.1) that superfluid ?He 
realize, in its B-phase, an A,, state for which d œ k. This means that, on each point 
of the Fermi surface, the order parameter is described by a pure |S, = 0) state if the 
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quantization z-axis is taken along k. Such a |S, = 0) state is equivalent to the spin- 
singlet case as regards susceptibility, leading to a vanishing susceptibility. In fact, it 
can be shown that for such an A, state, on average, the susceptibility is reduced to 
two-third of the normal state susceptibility at T = 0, as if for a given field direction, 
one-third of the spins were in the |S, = 0) state [1]. 

At the opposite, for an ESP state, where Cooper pairs are formed only with spins 
of the same direction, we expect no change of the susceptibility for fields along the 
quantization axis. However, this does not tell us what to expect in the perpendicular 
directions. 

Maybe the easiest way to understand if the spin susceptibility is reduced or not for 
a given field direction, and whatever the order parameter, is to rewrite the d-vector 
with the new quantization axis in this field direction, and check in this representation, 
whether or not the z-component of the d-vector (corresponding to the amplitude of 
the |S, = 0) state for that direction) is zero. Changing the quantization axis, and 
rewriting the d-vector for this new quantization axis, amounts to rotate the reference 
frame, or rotate in the opposite direction the d-vector in spin space (see Sect. 6.5.1). It 
is easy to see [see (6.11) for the rotation of the d-vector] that the z-component of the 
d-vector, when changing the quantization axis for the x- or y-axis, is, respectively, 
—d, or d,. What it means is that, in the case of strong spin—orbit coupling, where 
the d-vector cannot reorient depending on the field (H) direction: 


e The |S, = 0) component of the order parameter, where z is the field direction, is 
proportional to the d-vector projection along the field direction [which generalizes 
(6.24), which had a physical meaning only for real d-vectors]: if (d- H) is non-zero, 
there will be at least a partial suppression of the spin susceptibility and so, Pauli 
depairing for the upper critical field, for fields applied in this direction. 

e Whatever the d-vector, there is always at least one direction, where there will be 
Pauli depairing (otherwise, d should be the null vector). 


Coming back to the question of ESP states, if the phase between the A* and A‘ is 
constant on the Fermi surface and it is a unitary state, on top of the quantization axis, 
there is another direction (hence a whole plane) for which there is no change of the 
spin susceptibility (and no Pauli depairing) and a perpendicular direction for which 
the spin susceptibility is completely suppressed (see Sect. 6.11.5). For example, if d 
is of the form (6.26) 
3C-AT EAT) 
d=wy|-5( At+A‘) 
0 


with At = AV, then there is no Pauli depairing for fields in the x-z-plane and full 
Pauli depairing (as in the singlet case) for field along the y-axis. If the ESP state is 
non-unitary (|A‘| 4 |A*| on some part of the Fermi surface), then for sure, there is 
at least partial Pauli depairing in the two directions perpendicular to the quantization 
axis and still no Pauli depairing for fields along the quantization axis. 
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6.8 d-vector Representation of Some Known (or Suspected) 
p-Wave Superconductors 


6.8.1. Phases of Superfluid *He 


?He has been the very first case for p-wave superconductivity (superfluidity, to be 
precise!), and itis, beyond contest, a true paradigm for this state of matter. The reasons 
are that, despite its very low superfluid transition temperature (~ 1 mK on the melting 
curve), the spin state and most of the superfluid properties could be identified and 
studied with tremendous precision thanks to nuclear magnetic resonance (NMR): in 
superfluid *He, the Cooper pairs spin is the nuclear spin of the ?He atoms, which are 
directly probed by NMR. Moreover, the system is rotationally invariant, so with the 
simplest (spherical) possible Fermi surface, and with spherical harmonics as basis for 
the irreducible representations of the superconducting order parameter. In k-space, 
for a p-wave state, they read 


^ 3 a E ^ 3 a ^ ^ ^ 
Yık) = a (k, + ik); Yi ado = "E —ik,) ; Yi) = — 3k. . 
(6.32) 
It can be explicitly checked that [see (6.22)] 


A ^ 1 
k. 5 Lx Yim = k; k (k A -Vk Yim) =m Yim x (6.33) 
1 


Another reason for which superfluid ?He is a paradigm of p-wave superconductors 
is that it presents a rich phase diagram (shown in Fig. 6.3!), with three well-identified 
phases: two (called A and B) in the temperature—pressure plane, and an additional 
(A1) phase under magnetic fields, which are key references. A main topic these days is 
that of the topological properties of some superconductors, which has been addressed 
in great detail some tens of years ago for superfluid *He [5]. Let us examine them 
quickly. 


6.8.1.1 B-Phase 


The B-phase of superfluid ?He is simply characterized by 


d(k) = k ; or explicitely 


Ido) = v (Che + ik) tp + & +i) + BN) +14) - 
(6.34) 


! Adapted from ‘Heliums egenskaper vid låga temperaturer’, P. Berglund, Kosmos 1988, s. 63 
(Courtesy of the Swedish Physical Society). 


6  p-Wave Superconductivity and d-Vector Representation 183 


A 
ER 


30 
30 » N 
20 10 S 
10 S 
3 
1 


Fig. 6.3 Phase diagram, in the temperature—pressure—magnetic field space, of the superfluid phases 
of ?He. Three different phases, called B, A and A4, corresponding to different symmetries and d- 
vectors have been identified. The gap structure is also shown, represented by the distance between 
the inner sphere (the Fermi surface) and the exploded view of the outer surface. There is a uniform 
gap in the B-phase, and a nodal gap (with two nodes at the poles) in the A-phase. The A, phase, 
which appears only under magnetic field, is non-unitary and the gap is like that of the A-phase on 
the majority spin Fermi sheet, and zero on the other (only half the Fermi surface is paired) 


As d*(k) = d(k), this state is unitary and has both (S) = 0 and (L) = 0. Moreover, 
(d (k)| = 1, so that the gap is uniform on the Fermi surface, even though the average 
of d(k) is zero. 

In the simplest models, this B-phase should be the state of lowest free energy, 
notably due to the fact that the gap is fully open over the whole Fermi surface. 
However, this state has a reduced spin susceptibility deep in the superfluid state (see 
the discussion in Sect. 6.7.2). As the pairing mechanism involves spin fluctuations, 
this may be unfavourable compared to other states, notably ESP states, where such a 
reduction of the susceptibility is absent (this is the so-called ‘feedback’ mechanism). 
Therefore, as seen in Fig. 6.3, another phase, the A-phase, is stable notably along 
the melting line and becomes the dominant phase under field. As will be seen below, 
this A-phase is indeed an ESP state and it has a nodal gap structure. 
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6.8.1.2. A-Phase 


The A-phase of superfluid ?He is simply characterized by 


3. ^ 
d(k) = [3 + ikz, 0, 0) , 


a 3. on (6.35) 
|Y (k)) = EG Tdk)] 11)—1 44 - 


So, in the A-phase, the excitation gap vanishes for k, = k, = 0; as shown in Fig. 6.3, 
it has two nodes on the poles of the Fermi surface. Moreover, this ESP state is unitary, 
as (S) = 0 (since d* (K) A d(k) = 0). But (L) is non-zero. In fact, the orbital state 
is selected (by dipolar coupling), so that d and (L) are either parallel or antiparallel. 
Following (6.28) 
h (dQ,. a a ; dQ 

1) = E Fk CIO (ey eie) d Tod +12), 

(ys T TR, ~ ik.) (ici, ike, + f, (ez — iey)) , 

(L) ~h $- a (@ + Kex + (EY (e, - iey)) 

(L) = hex . (6.36) 
The physics of this phase is very rich, notably when considering the weak coupling 
between the orbital and spin moments due to spin-orbit interaction, the existence of 
spin currents, the chirality of the excitations close to the nodes... Again, the review 
by A. J. Leggett [1] is a seminal paper. 
6.8.1.3 A, Phase 
The A, phase appears under field with only one spin direction paired: it has the 
same orbital moment but in addition also a finite average spin. If we keep the same 


convention for the normalization of |W(k)) and d(k), despite the fact that only half 
the Fermi surface is paired, we get 


[Wk)) = —v 3, + ik] 10), 


d(k) = Bd + ik,), —i (k, + ik,), 0) . (6.37) 


For this A; phase 
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(S) = ing Tp (ded — dde, 


dQ T" dQ, 95 
=2h a; hy tik e;/2 a; 6 t E) (6.38) 


=he,, 
(L) = fe, . 


Its stability arises from the fact that when the Fermi surface is polarized, the density 
of states increases with kp, and from the fact that, in ?He, the spin-orbit interaction 
is very weak. So, the up-spin and down-spin Fermi surfaces are almost completely 
decoupled and the largest Fermi surface may have a larger transition temperature 
than the other. Hence, the stability range of this A; phase grows under field (see 
Fig. 6.3). 

As we shall see, the situation should be completely different in uranium-based 
ferromagnetic superconductors, where such a phase is very unlikely due to the cou- 
pling between the different Fermi sheets induced by spin-orbit interaction: like for 
most multigap superconductors, in such a case, there is a unique transition temper- 
ature, even if the different gaps may have different sizes. A possible very singular 
exception will be discussed in Sect. 6.10. Coming back to *He, the A, phase is the 
paradigm of a non-unitary state, with a finite value of (S), a vanishing gap on one 
Fermi surface, and a nodal gap (axial gap) identical to that of the A-phase on the 
other Fermi surface. 


6.8.1.4 Planar and Polar Phases 


Some other states may also be favoured in *He, due to peculiar constraints [lower 
dimensions, aerogel (disordered) background, ...]. These are in any case useful ref- 
erence states for the more complicated cases of superconductors in crystal lattices. 
Notably, there is the planar phase and the polar phase, which are derived from the 
B-phase. The planar phase is defined by 


d(k) = Ee ky, 0), 


; (6.39) 
Ide) = wi (ke +i) t +e ibn 1) . 
Conversely, the polar phase is defined by 
d(k) = V3(0, 0, £.) , 
S i ) (6.40) 


Iw &)) = YVIR ANII. 
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These two states have also (S) = 0 and (L) = 0; however, the planar state has point 
nodes along the z-axis whereas the polar state has a line of nodes on the equator. 
Both are also unitary, ESP states (see Sect. 6.5.3). 


6.8.2 UPt3 and Sr2RuO4 


UPt; is a ‘heavy fermion’ metal, meaning that it is an inter-metallic system with 
very strong electronic correlation effects, leading to a strong renormalization of the 
effective mass of the electronic quasiparticles. It has been the first heavy fermion 
where these effective masses have been directly measured (up to 160 times the free 
electron mass) on the different Fermi sheets, by quantum oscillations, and it has also 
been the first superconducting system (after superfluid >He) where phase transitions 
between different superconducting phases have been observed (see [6] for a review 
and Fig. 6.4). 


6.8.2.1 Phases of UPt; 


The reasons leading to these phase transitions and the nature of the various supercon- 
ducting phases have been the subject of many different proposals. There is a global 
consensus that superconductivity in UPt should be triplet (odd parity). Neverthe- 
less, many questions remain without a definite answer. A first (still open) question, 
for example, is whether or not the spin component is free to rotate in the hexagonal 
crystal lattice of UPt3;. This will determine the response of UPt3 under the application 
of an external field when it is superconducting. The orbital part (the k-dependence) 
of the superconducting order parameter is constrained by the broken symmetries in 
the superconducting state inducing, for example, nodes of the order parameter and so 
of the gap in some particular directions: if spin-orbit coupling is strong enough, then 
the d-vector is expected to be pinned in some crystal direction; if spin-orbit coupling 
is weak, as in superfluid ?He, the d-vector should be free to rotate and the response 
to a magnetic field should have the same anisotropy as in the normal phase. Because 
pairing is mainly driven by the 5 f electrons, spin-orbit coupling is expected to be 
strong also for the Cooper pair wave function, and pinning of the d-vector is likely. 
However, this hypothesis has no definite experimental support (see Sect. 6.8.2.3). 


6.8.2.2 E>, Representation 


Among the models assuming such a strong spin-orbit coupling pinning the d- 
vector in a fixed crystallographic direction, the so-called E», representation has 
been strongly developed. It is an ‘f-wave’ order parameter, which can have various 
symmetries (six basis functions are necessary to describe the most general order 
parameter). Among these, the most successful [7] proposes a d-vector with some 
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Fig. 6.4 Phase diagram, in the temperature-magnetic field space, of the superconducting phases of 
UPt3. Three different phases called A, B and C corresponding to different symmetries and d-vectors 
have been identified. The gap structure is also shown for the different phases, like for superfluid 
?He in Fig. 6.3, as proposed for the ‘E>, model’. This E»,, model is coherent with results from 
thermal transport and upper critical field measurements, but not with NMR measurements of the 
Knight shift (see Sect. 6.8.2.3) 


accidental restrictions (like the fact that the d-vector would only have components 
along the hexagonal c-axis); with these restrictions, it matches numerous experimen- 
tal probes. 

d(k) = (g4(T)2k,k,k, + ec(T)k. ( — KD) e; . (6.41) 


In the A-phase, gc (T) = 0, in the C-phase, a (T) = 0, and in the B-phase, at 
low temperature and low field, g4(T) = +i gc(T): d(k) = e(T)k;(k, + ik, )?e;. 
For this model, all phases are unitary, but the B-phase is chiral, with a non-zero (L). 
Muon experiments or recently polar Kerr effect [8] could have detected such a chiral 
component. 

With its pinned d-vector, in the A- and C-phases, the spin component is zero along 
the c-axis (d || e.), and for any field direction in the basal plane, the order parameter 
behaves as an ESP state (see Sect. 6.5.3). So the Pauli spin susceptibility should be 
suppressed (like for a singlet superconductor), whereas it will be unchanged in the 
basal plane. This feature, which guided the choice of this E», representation, can 
explain the famous ‘crossing’ of the upper critical fields (H,2) of UPt3[9] along the 
basal plane (no Pauli limitation of H5) and the c-axis (Pauli limitation of H,2). 
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6.8.2.3 Eiu Representation 


However, NMR experiments seem to be in contradiction with this interpretation of 
the crossing of the upper critical fields of UPt3. Indeed Knight-shift measurements, 
which are the closest to a measure of the spin susceptibility in the superconducting 
state, found no change in the superconducting state for fields applied along the a 
or b directions in the C-phase, but also no change for field along the c-axis except 
at very low field deep inside the B-phase [11]. Therefore, other models have been 
proposed for UPts, which are much closer to the situation of superfluid ?He, with a 
weak spin-orbit coupling allowing for a field reorientation of the d-vector as long 
as the field is “strong enough’. Knight-shift measurements can tell nothing on the 
gap nodes, but combining angle-dependent thermal conductivity measurements [10] 
with the NMR result, a E;,, scenario has been proposed, predicting a non-chiral state 
[d(k) « cg — 1) e; + pec) in the B-phase]. This scenario can also more easily 
give account of some other features of the phase diagram (like the existence of a 
tetracritical point in all field directions): see Fig. 6.5. For the different phases, in this 
model, the d-vector would be as shown in Table 6.1. 

So, as can be seen from Table 6.1, at low fields the d-vector does not depend on 
the field orientation [check the A-phase and B-phase-(low H) lines of the table]. In 
the B-phase, for H || c, there is a field-induced reorientation of the d-vector: at low 
field, with d x (&, ep + [^ ec), the d-vector is not perpendicular to the c-axis (except 
on the line £j = 0) so there is a finite |S, = 0) component of the spin along the 
field. This would be imposed by the orbital part of the wave function and spin-orbit 
interaction or coupling to the small antiferromagnetic moments acting as a symmetry 
breaking field. But for fields above 0.22 T, with d « (ka ep + ky e;), the d-vector is 
real and always perpendicular to the c-axis, so we know that it is equivalent to an 
ESP state in that direction. Hence, the field-induced rotation of the d-vector. In 
the high-field C-phase, where the Pauli limitation could be at play, we note that in 
Table 6.1 the d-vector is always perpendicular to the field direction, so that again, it 
is an ESP state explaining the observed absence of change of the Knight shift (but in 
contradiction with the H,2 anisotropy). Note also that all these features are preserved 
ifi in the B- phase, the d- vector is a complex combination of e, and e,, or e; and e,: 
(ka ep + ik, e); (ka ep x - ik, e;). 

Then, the B-phase would be chiral (as in the E», model), but also non-unitary 
(d(k) ^ d*(k) Æ 0, see Sect. 6.6.3). So was the original proposal in [11]. It is an inter- 
esting example of a non-unitary state with no global spin polarization, e.g. with d œ 
op — D(ka ep + ik, €a), we derive from (6. 25) and (6.31) that (S(k)) x (5k2 — 
Df, fe, and At (k) œ [SK — 1) (ka + b), AY (k) « 1662 — 1) (ka — kl. So 
indeed, averaging over the Fermi surface leads to no net spin and equal averaged 
gap amplitudes for up-spins and down-spins, even though they are different for most 
k of the Fermi surface. 
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Fig. 6.5 Phase diagram in the temperature—magnetic field space of the superconducting phases 
of UPts : the same phases as in Fig. 6.4 are represented, but different gap structures are proposed, 
according to the Ej, symmetry (the Fermi surface is reduced to a point in this representation). 
The magnetic field H is parallel to the lattice vector b. Reproduced from [10] with permission 
(Copyright 2012, American Physical Society) 


6.8.2.4 p-Wave Superconductivity in Sr; RuO; 


This system has also early been proposed as a candidate for p-wave superconduc- 
tivity. In fact, due to the accessible range for the studies (Tsc is slightly larger than 
1K), and the absence of radioactive elements, as well as the popularity of oxides, it 
is certainly the most studied ‘p-wave’ superconductor ever (for a review, see [12]). 
The most ‘fashionable’ order parameter is very similar to the superfluid ?He A-phase 


d(k) x (ky + i ke; . (6.42) 
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Table 6.1 The d-vector for the various phases and magnetic field orientations in the E,, model of 
UPt3, as proposed in [10, 11]. The distinction ‘low field’ or ‘high field’ is meaningful only in the 
B-phase for H || c, and is suggested by NMR measurements which show a decrease of the Knight 
shift in the whole B- and A-phases for H || b, and for H || c in the B-phase only, and for fields 
lower than 0.22 T. Above this value, the d-vector would rotate and the change of the Knight shift 
disappears (see [11]) 


Phase Hija H || b Hic 
A (5k2 — 1) (ka ey) (5k2 — 1) (ka ep) (5k2 — 1) (ka ey) 
C (5k2 — 1) (kp ec) (5k2 — 1) ec) (5k2 — 1) (kp ea) 


B (low H) (5k? — D) (ka ep + koe.) | Gk2 — D (kae - kpec) | (Sk2 — D) ka ep + kp ec) 
B (high H) (5k2 — 1) (ka ep + ke)  |(Gk2 — Dk eg -kpe;) | (5&2 — D (ks ep + ky ea) 


Itis a unitary chiral state with (L) = +A e, and point nodes along the c-axis. However, 
due to its quasi-2D Fermi surface, there is no k vector on the Fermi surface at the 
node position. There are many contradictory experiments on this system and several 
have claimed to have detected or refuted the time-reversal symmetry breaking in this 
compound (this is also true for the B-phase of UPt3). So today, there is still no firm 
conclusion on whether or not (6.42) is the correct gap symmetry for Sr; RuO,, and 
even on whether or not it is really a p-wave superconductor. Indeed, the most recent 
NMR studies corrected previous results and demonstrate now that the Knight shift 
does decrease in the superconducting phase when the field is applied in the basal 
plane, ruling out one of the strongest support for an order parameter of the above 
form [13, 14]. 


6.9 Ferromagnetic Superconductors 


Since 2000, three systems with a true homogeneous coexistence of ferromagnetic 
order and superconductivity have been discovered; all of them are uranium based. 
The first, UGe» [15], is only superconducting under pressure, the other two, URhGe 
[16] and UCoGe [17], are superconducting at ambient pressure. In the three cases, the 
same 5 f electrons from the uranium ions are responsible for the ferromagnetic and 
the superconducting orders, and the Curie temperature (Tcurie) is always larger than 
the superconducting transition temperature (Tsc). Intuitively, these two orders seem 
antagonistic, as it is known that superconductivity is suppressed by large fields. 
However, it is important to be more precise in order to understand why and how 
ferromagnetism and superconductivity might coexist. 

The first point to have in mind is the two kinds of magnetic fields associated 
with ferromagnetic order: there is an internal magnetic field B;,, also called the 
‘dipolar field’, arising from the spontaneous magnetization in the sample (B ~ M, 
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Table 6.2 Orders of magnitude of some important parameters, including an estimate of the inter- 
nal (dipolar) magnetic field Bint (coming from the spontaneous magnetization) and the effective 
exchange field Bexc, in the three known uranium-based ferromagnetic superconductors. For UGe», 
which is superconducting only under pressure, we have indicated the Curie temperature and the 
ordered moment Hora at the pressure of 1.2 GPa, where Tsc is maximum. For Bexc we only give a 
lower bound deduced from the value of the Curie temperature 


UGez (1.2 GPa) URhGe UCoGe 
Tcurie 35K 9.5K 25K 
Tsc 0.8K 0.25 K 0.5K 
Lord ~ | ug 0.4 ug 0.05 ug 
Bint ~ M 0.2 T 0.09 T 0.1 T 
Bexe > 25 TCurie 50T 13T 4. 5T 


if one neglects demagnetization effects) and there is the exchange field B,,. which 
is a very short range effective magnetic field, acting only on the electron spins, and 
arising from the Coulomb interaction and the exclusion principle. This exchange 
field appears in a mean-field treatment of the spin-spin exchange interaction term. 
These two fields have very different orders of magnitude. The first is rather small in 
these systems, owing to the weak ordered moment (see Table 6.2); indeed, the three 
compounds, when they are not in the itinerant limit, remain close to it, so that this 
internal field is in any case much smaller than the (large) orbital upper critical field. 
However, the exchange field, whose scale is fixed by kg 7curie/[tB, is much larger 
than the Pauli paramagnetic limit (of the order of 2kg Tsc / jg). Table 6.2 reports the 
values of these fields for the different compounds; a recent review has been published 
in [18]. 


6.9.1 ESP States 


In these uranium-based ferromagnetic superconductors, superconductivity sets in 
below Tcurie, so that Cooper pairs are formed from a spin-polarized Fermi surface. 
Intuitively, one can guess that if the polarization is large enough (typically, if the 
difference in the Fermi wave vectors is larger than the inverse coherence length), this 
leaves little choice but to form Cooper pairs ‘independently’ on the Fermi sheets with 
different spin orientations (see Fig. 6.6). In other words, the strong exchange field 
present in the uranium-based ferromagnetic superconductors seems only consistent 
with an odd-parity (triplet) superconducting order parameter. Moreover, in case of 
large polarization of the bands (compared to A), one expects only ESP states to be 
favoured. Choosing the quantization axis along the easy axis 


[W) = AT] tt) + AY LL). (6.43) 
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Fig. 6.6 Scheme of the 
spin-dependent density of 
states in a ferromagnetic 
metal. If superconductivity 
develops on spin-polarized 
Fermi surfaces, due to the 
difference of wave vectors at 
the Fermi level, mainly 
up-up and down-down 
Cooper pairs can be formed 
(short arrows). This leads to 
an ESP state with different 
weights for the majority and 
minority spins, so to a 
non-unitary ESP state 


In such a case, the d-vector would have the general form: 


1 
d, = —(-A' + A‘) 


M 


Er 


vy 
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N(E) 


2 
-i 6.44) 
dy TA + AY) 
d. =0 
Then (S) at a given k is 
(S) =i h(d ^ d*) 
= a (At + A%)(A™ -- AV) 4 AT+ ANCA™ + AP) e, 
(ATA — [A* P) 
= h——\——r &: . 
(1At? + A?) * 
(6.45) 


N(E) 


This has been already seen when discussing ESP states [see (6.27)]. It is a very 
natural result; for an ESP state, there is a finite spin (and non-unitary state) if and 
only if the weights of the | 11) and | ||) components are unbalanced. Conversely, 
for a ferromagnetic superconductor with at least partial band polarization, one does 
expect to have such a non-unitary state, which should be an ESP for a ‘strong enough’ 
exchange field. It could also be chiral, but it should be at least non-unitary. Before 
that, letus see what are the possible order parameter from group theory considerations 
for orthorhombic systems [19]. 


6  p-Wave Superconductivity and d-Vector Representation 193 


6.9.2 Symmetries 


Only two one-dimensional representations are left, called A and B, due to the low 
orthorhombic symmetry. In the paramagnetic state, the first one (A) looks very much 
like the B-phase of superfluid ?He 


d4(k) oc us k,e, + uykyey + ucke; , (6.46) 


where ux, Uy, u; are real functions of k with full orthorhombic symmetry. So in the 
paramagnetic state, we start from 


AT = —u,k, + iu yk, 
Ay = turk, +iuyky . (6.47) 
AT = uk, 


In the ferromagnetic state, the amplitude (and possibly the phase) of the A\ and 
Ay components can differ. So the order parameter should read 


Al =n" (-usky + iuyky) = —ntky +intk, 
AJ = gH (Husky + iuyky) = n1, + intk, (6.48) 
AS = nou-k, = nok. 


with no phase difference between the complex amplitudes n, nt, n°. In terms of 
d-vectors 


1 t 1 A " 0 
d = —[-A'"(f)(e, + ie) + A* @)(ex — ie,)] + Ave, 


2 
ub aei — in! E, (ex + iey) + Q1, + intky)(ex — iey)] + nte: 
ex e es i ;e 
z; [et n) - (nf mp) 
+ [(n! +n) ky +i(n! — nl) & e, y + 2nvkee.| : 
(6.49) 
Similarly for the B-phase, the order parameter reads 
AT = T 
Apes (6.50) 


A5 = Ok. + ico, 
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1 A : z ] 0 
d= —([-A' (&)(e, + iey) + A (&) (e; — iey)] + Ae. 


2y 
1 f : i . ^ : ^ 
aj (Sake) (e, + iey) + GI) (es — iey)] + (Pk, + ity k,)e: (6.51) 
Se [c -t)&e. -i(ct - c he, +2 (cok. + if) e.] . 
2y z z z z M MA z 


In the general case, neither the A- nor the B-phase have symmetry-enforced nodes. 
However, if an ESP state is enforced by strong band splitting, meaning that the A? 
component vanishes in (6.48) and (6.50), then 


e the order parameter of the A-phase vanishes for k, = ky = 0: the A-phase has 
poles on the z-axis and 

e the order parameter of the B-phase vanishes for k, = 0: the B-phase has a line of 
nodes on the equator. 


This is correct, but maybe more important insights on these ferromagnetic supercon- 
ductors can be learned, notably on the relationship between up and down components, 
from a more general microscopic model [19]. The equations look unfriendly at first 
sight, but at the end, a nice physical picture emerges. 


6.9.3 Microscopic Model 


For these ferromagnetic superconductors, all models start from the same pairing 
interaction, supposed to arise from the magnetic interactions: this was already pro- 
posed in the de Gennes book on superconductivity ([20] page 104)! So they start 
from a Hamiltonian 


1 
Hin, = — shal J Prdr'S m) -— rS), (6.52) 


where / is an exchange constant and y;; the medium magnetic susceptibility (matrix). 
The models differ notably on the expression for this susceptibility. However, starting 
from such an Hamiltonian, the derived gap equations necessarily couple the different 
components of the order parameter. In the case of ferromagnetic superconductors, 
a peculiarity found in all systems is that the susceptibility has a marked uniaxial 
anisotropy (Ising type), so that if z is the easy magnetization axis, x,, will be a 
dominant term in the susceptibility matrix. 

In the following, we will just present and comment the linearized gap equations 
(in the weak-coupling limit) to show where the approximations come into play and 
what are the physical consequences. To understand the derivation of these equations, 
please refer to [19]. The gap equations (at Tsc) read 
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Atk) = -T Y XIV GIGA (k) 
n k’ 
+ Vt gg A (k^ + vt? ca* a + G* G) APQC)] 


Ak) = -T Y Vv! GIGA R’) 
n k’ D 
+ VH GG AK + v9gtat + G* G) AT(CO)] 
Mk) = -T $ M "IV"! G! G' AT k’) 
n k’ 
+ V9 Gt G! A (K^ + V9 (G*g* + GT G') AT(KC] 
(6.53) 
where the different interaction terms V^? are expressed from the susceptibility by 


Vit = uh P xe ; VW = uh P y 
VN = -uP (xh, — x ixi) : V = ua O — xy + 2i) 
yf? = -up (x? ix + 2ixk)s V = -uP Ox —ixy, + 2x) 


sis FX = Kez 
2 


y? — Lal 
(6.54) 
In the above equations, 


VË = V*P(k, k') and xi = x" (k, k’) = 1 (xij(k — k’) — xij(k + K’) . 


Strong band polarization (like that due to an ‘exchange field’) much larger than 
Tsc means that we can cancel all terms with Green functions arising from bands of 
opposite spins (G* Gt-type terms). However, (6.53) shows that this is not enough 
to ensure an ESP state with A°(k) = 0. Indeed, the different components are all 
coupled together as a multigap system and a non-zero A°(k) can be induced by the 
At, A? components 


Atk) = -T 5 DIV G'G'AT(O) E VH GTGT AT (K] 


n k 

Atk) =-T $ J IV"G'G'AT(Ó) +VEGGAK)) | (ess) 
n kK 

Ak) = -T 5 J IV? G GT ATQ) + VG GAE’) 
n k’ 


To go further, we need to make approximations based on the characteristics of the 
susceptibility. All non-diagonal terms of x;; are zero at k = 0, but, in principle, can 
be finite at finite k. In a Landau framework, they would arise from gradient terms 

ðM; 


of the form xj» SO from spin-orbit coupling. If the spin—orbit coupling is weak 


enough, we can further neglect these terms, then V1? and V? are suppressed and 
we do get an ESP state: A°(k) = 0. However, in uranium-based systems, spin-orbit 
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coupling is usually considered as 'strong' and most models suppose that the d-vector 
has a fixed direction, imposed by the orbital part of the order parameter and ... spin— 
orbit coupling. Therefore, considering these systems as pure ESP states is probably 
only an approximation: spin-orbit coupling most likely induces a (small?) A? finite 
component, even with strong band polarization! But this is not the only surprise 
which emerges from these microscopic equations. Even if we suppose that A? — 0, 
another counter-intuitive result emerges. The equations in the ‘ESP approximation’ 
are written as 


Atk) — -T» J IV G'G'ATQ) + VH GT GT AT (K] 


n k’ 
(6.56) 
Atk) = -T 3 SIV Gtatatk’) +V GG AH 
n k’ 
with A 

vit= yH = —-pl xt 

N M u (6.57) 
y = Wit = pe Pk - x») 


It corresponds to the equations of a two-band superconductor, with intra-band cou- 
pling controlled by the longitudinal susceptibility x^ and inter-band coupling con- 
trolled by transverse susceptibilities. 

For such ESP states, the possible order parameters of ferromagnetic supercon- 
ductors are the above-mentioned A or B states, with d-vector 


d, = E {[(t +n!) ky — i (n? — n?) | e 


k 
+| (nt +n) ky ei(nd n) & let. 
Tr 1s — Ne) TD (6.58) 
) 


z; (e: — nt) ke, —i(n! +n} 


dz = 


At the same level of approximation, equations for ny, ny are decoupled. So, for 
both the A and B states, the equation for the largest Tsc is that of a two-band 
superconductor (where e is a characteristic energy) 


1 
Tsc = € exp (--) ; 
$ 


2 
T ot (6.59) 
EL CE aa 


2 ii 4 


6 p-Wave Superconductivity and d-Vector Representation 197 


Here, gf) a VIDI ox x22, gl! oc V1 V œ (Xxx — x,,). Equation (6.59) shows 
that, as for any two-band superconductor, Tsc should increase if the inter-band cou- 
pling is increased. In this case, Tsc should increase if the difference between the 
transverse susceptibilities increases. This is surprising for two reasons. 

The first is that it was believed, since the pioneering work of D. Fay and J. Appel 
[21], that Ising anisotropy was most favourable for ferromagnetic superconductors 
because transverse fluctuations would be pair-breaking as they would force scattering 
from one Fermi sheet to the opposite polarization Fermi sheet. However, this paper 
was written before the discovery of superconducting MgB; and the following boost 
of work on multigap superconductivity: we understand now that these transverse 
fluctuations do also induce exchange of Cooper pairs from one Fermi sheet to the 
other, which is favourable to superconductivity. And so the prediction is that 2D 
anisotropy (rather than uniaxial anisotropy) is the most favourable for ferromagnetic 
superconductors (maximizing both xz; and Xxx — Xyy). 

The second reason is that, experimentally, all the systems where ferromagnetic 
superconductivity has been discovered did show a strong uniaxial anisotropy, con- 
firming the prediction from [21]. But making statistics on few elements is always 
dangerous. We also found that reducing this uniaxial anisotropy in URhGe, using 
stress along the b-axis which increases the xp, susceptibility without changing Xec 
(for stress below 0.6 GPa), does increase Tsc in URhGe: a factor 2 between 0 and 
1 GPa [22]. Ising anisotropy is probably not the most favourable, and larger Tsc 
ferromagnetic superconductors might be awaiting to be discovered. 

Coming back to the consequences of (6.58) in terms of order parameter, with 
decoupling of the equations for 7, and 7,, the order parameter should have a line of 
node, (kx or ky or k; = 0), a finite (S) of order (at given k) 


8\ K 
ee (5) Le) M 


onin, p= tm 
2 j 2 j 


(6.60) 
ô 


But it is not chiral ((L) = 0). The A state could be chiral if 7, and n, remain coupled 
(in this model, if x, 4 0), with 


(Q0 Gym + deny) (KÈ KD 
i= "a + 822) 4-8)» x 


Note that in principle too, if such is the case, d is probably not an ESP state any 
more, meaning that the Ag component should be non-negligible and all expressions 
should be much more complex. 
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6.10 UTe; 


The last discovered p-wave superconductor is again an uranium-based system, also 
orthorhombic, and again close to a ferromagnetic instability but not ferromagnetic: 
UTe;. This time, the Tsc is even more accessible: between 1.4 and 1.6K from bulk 
measurements depending on the samples [23, 24]! Moreover, it presents similar 
astonishing field-reinforced superconductivity [23, 25, 26], with an absolute record 
(for such a low-Tsc system) of an upper critical field higher than 60 T [26]! An 
interesting point concerning the possible d-vector for such a paramagnetic system 
is the observation, on all samples, of a finite residual term of about half the normal 
state value of the specific heat C/T. The origin of this term is still unsettled, but an 
interesting proposal was that it would arise from a state similar to the Aj, state of 
superfluid ?He (see Sect. 6.8.1.3). 

Naturally, this can only happen if spin-orbit coupling is weak enough (otherwise, 
it would not be possible to form Cooper pairs on one Fermi surface and not on the 
other), and in this case, indeed, group theory classification leads to the possibility of 
such states (see Table 1 in [27]) 


d(k) = (1, i, 0)g(k) . (6.62) 


Moreover, because this is for a weak spin-orbit case, any rotation of the d-vector is 
a possible order parameter. According to (6.17), this means that the order parameter 
would be simply A (k) = At q(k), with no other component. In the case of UTe», the 
largest susceptibility axis is the a-axis, so the quantization axis should be along a. 
In such a case, the Fermi surface with down-spin would remain unpaired, explaining 
the residual specific heat term. 

Another consequence of such an order parameter is that it is non-unitary, with a 
finite spin for the Cooper pairs (along the a-axis). For the total system, this spin of the 
Cooper pairs would be compensated by that of the unpaired electrons (from the down- 
spin Fermi surface). This might lead to a total null magnetization. Nevertheless, in 
such a case, there is no reason that the state with such a d-vector would be stabilized, 
as half the condensation energy is lost compared to any other state, where pairing 
occurs on Fermi sheets of each spin direction. For such a state to be favoured, one 
needs some advantage of having this spin polarization in the superconducting state, 
so, forexample, a form of coupling between the spin Cooper pairs and the normal state 
magnetization [27]. Then, this would also induce, like for superfluid ?He in the A; 
state [28], a finite magnetization when entering the superconducting phase; globally, 
it is as if the system would become ferromagnetic on entering the superconducting 
state, with a (weak?) magnetization increasing linearly with temperature below Tsc 
[27]. Up to now, this has not been detected, and it remains to be settled if this (rather 
improbable) hypothesis is valid or not. 

In any case, this last system beautifully confirms that p-wave superconductors 
are an incredible playground, where almost every new system brings its own share 
of surprise and stimulating challenges. 
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6.11 Proofs and Exercise Solutions 


6.11.1 Proof of the Cayley-Klein Relation 


Proof 


Ro (a-0)'R. 9 
- (cos Q/21-—i sin Q/2 $a) (a-0) (cos 92/21 +i sin 2/2 $c) 


E (cos Q/21 — i sin Q/2 $«) | cos Q/2 (ac) 


+i sin Q/2 (a .$)1 4 i(a ^ %)-0) | 
= cos? Q/2 (a-o) + i cos 2/2 sin Q/2 (a - Q)1 
— cos Q/2 sin Q/2 (a ^ €) - o 
— i cos Q/2 sin Q/2 (a: $1— i(a^ $c) 
+ sin? Q/2 (a . $y($o) +7 ($ ^ (a^ &$) «) 
= (a. Q)(Q-0) + cos? 2/2 (a = $$) -0 + sin Q (a ^ Q)-o 
— sin? Q/2 (è ^ (a^ $) " 
= (a- $)($-0) + cosQ (a = $$) 0 + sin Q (a ^ €)«o 
= Re(a)-a. 


6.11.2 Conservation of the Scalar Product under Rotation 
with the Definition (6.11) 


Solution 6.1 Let us note that we can write, for any (complex) vectors u, a, b, c, d, 


u = (u- Ê)Q + (u— (u- $9) = ud, + ud. 


R(u) = ud, + R(ud,) = ud, + cos Qud, + sin Q (Ê ^ u) 
ad (bAc) = e" "a bicn = (a-c)b — (a-b)c 
(a ^ b)^ c= (a-c)b — (c-b)a. 


R(d) - Ru) = dy-udy +R(d1)- R(uL) so 
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R(d) - Ru) 
= (d. &(u- €) 
+ cos? 9 [d-u — (d )(u- $) — (u$ €) + (4 $561 | 
+ sin? Q [à (A ($^ 2 
+ sin Q cos Q [d Êr + w- : «dj 
= (d. &(u- €) 
+ cos? Q [d-u — a- Ôu- $)] 
+ sin? Q [d-u - (d. $(u. %)] 
R(d) - Ru) = d-u. 


6.11.3 Conservation of the Cross Product under Rotation 
with the Definition (6.11) 


We want to check if R(u ^ d) = R(u) A R (d). To evaluate R (u ^ d), let us decom- 
pose u ^ d in parallel and perpendicular parts to Q (called (ud) and (ud) | 


— 


u ^ d = (udadi) + (ud); Adi) + (ud); ^ di) — (dj ^ (ud), 
ae ma — —————Ó—M— 
(ud), (ud) 
— [u^ d) 212 + [a DAA a. | z [a- 2)» m 
— m 


~ 
—_—_————————— ŘŘŘŘŘĖ 
Gi (ud), 


— 


R(u ^ d) = (ud), + R ((ud),) 


R((ud)_) 
= cos Q (ud), + sin Q € ^ (ud), 
= u- $)€ ^ [cos 2d + sing ^d] — (a. $$ ^ [cos 2u c sin Ê Au] 
= (u- $)€ ^ R(d) — (d- €) ^ Rw) 
= (R(u)  Q)€ A R(d) — (R(d) - MLA Ru) 
OK E ^ (R(d) ^ Ra) | 


Ru ^ d) = Ru) ^ R(d) + (ud), — (RU) ^ R(d)), 
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(Ru) ^ R); 

= Ru), ^X, 

= (cos 2u, + sing (Ê A u,)) ^ (cos d, + sin (Ê ^ d,)) 


= cos? Q (u, ado) 
+ sin? Q($ Aui) A (QA di) 


+sin2cosQ (u ^($^d) -di A(QA u1) 
= cos? Q (ud), 
+ sin? Q ((@ A u1)-d1) $ 


+ sin Q cos Q (tu.-d: € — (d. €) 


= cos? Q (ud), + sin? Q ((u ^ d) ê) Ê = (ud), 


So indeed, R (u ^ d) = R (u) A R(d). 


6.11.4 Rotation of the d-Vector of a Simple “Up-Up” State 


Solution 6.2 For such a state, from (6.17), we get that 


1 
NIE. 
v\ 9 


Changing the orientation of the (z) quantization axis amounts to a rotation of 6.17 
by x around e,. From (6.11), we get 


Rid) = d,e, — (d —d, ex) 
—-—d, 


which is indeed what we would expect! 


6.11.5 Equivalence of ESP Unitary States and Pure |S, = 0) 
States 


Solution 6.3 Such an ESP state would have only A4 and A, components, equal 
within a phase factor. Let us write A, = g AP A. The d-vector of such a state will 
be [see (6.17)] 
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1 A (e=? — 1) 1 —ie ^ Ay sin Q 
d= T -r (1 +e7#) | = —ie™? A4 cos o 
0 0 
: cos 6 
If we rotate this state by 2/2 around a vector € = | sin | in the (x, y)-plane, we 
0 


obtain from (6.11) 


Ro(d) = (d- 22+ (QA d) 


sin(0 + Q)e Ay cos 0 
= — | sin(0 + g)e * A, sind 
—ie 9 A, cos(0 + Q) 


So indeed, if we choose 0 = —Q, we recover a pure |S, = 0) state. Note that it is the 
phase between the A; and A, components of the order parameter, which determines 
the precise direction of the required 77/2 rotation. 
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